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To My Parents 




Preface 


This book is composed of two parts: Part I (Chaps. 1 through 3) is an introduction to 
tensors and their physical applications, and Part II (Chaps. 4 through 6) introduces 
group theory and intertwines it with the earlier material. Both parts are written at the 
advanced-undergraduate/beginning-graduate level, although in the course of Part II 
the sophistication level rises somewhat. Though the two parts differ somewhat in 
flavor, I have aimed in both to fill a (perceived) gap in the literature by connecting 
the component formalisms prevalent in physics calculations to the abstract but more 
conceptual formulations found in the math literature. My firm belief is that we need 
to see tensors and groups in coordinates to get a sense of how they work, but also 
need an abstract formulation to understand their essential nature and organize our 
thinking about them. 

My original motivation for the book was to demystify tensors and provide a uni- 
fied framework for understanding them in all the different contexts in which they 
arise in physics. The word tensor is ubiquitous in physics (stress tensor, moment- 
of-inertia tensor, field tensor, metric tensor, tensor product, etc.) and yet tensors are 
rarely defined carefully, and the definition usually has to do with transformation 
properties, making it difficult to get a feel for what these objects are. Furthermore, 
physics texts at the beginning graduate level usually only deal with tensors in their 
component form, so students wonder what the difference is between a second rank 
tensor and a matrix, and why new, enigmatic terminology is introduced for some- 
thing they have already seen. All of this produces a lingering unease, which I believe 
can be alleviated by formulating tensors in a more abstract but conceptually much 
clearer way. This coordinate-free formulation is standard in the mathematical liter- 
ature on differential geometry and in physics texts on General Relativity, but as far 
as I can tell is not accessible to undergraduates or beginning graduate students in 
physics who just want to learn what a tensor is without dealing with the full ma- 
chinery of tensor analysis on manifolds. 

The irony of this situation is that a proper understanding of tensors does not 
require much more mathematics than what you likely encountered as an undergrad- 
uate. In Chap. 2 I introduce this additional mathematics, which is just an extension 
of the linear algebra you probably saw in your lower-division coursework. This ma- 
terial sets the stage for tensors, and hopefully also illuminates some of the more 
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enigmatic objects from quantum mechanics and relativity, such as bras and kets, 
covariant and contravariant components of vectors, and spherical harmonics. After 
laying the necessary linear algebraic foundations, we give in Chap. 3 the modern 
(component-free) definition of tensors, all the while keeping contact with the coor- 
dinate and matrix representations of tensors and their transformation laws. Applica- 
tions in classical and quantum physics follow. 

In Part II of the book I introduce group theory and its physical applications, which 
is a beautiful subject in its own right and also a nice application of the material in 
Part I. There are many good books on the market for group theory and physics (see 
the references), so rather than be exhaustive 1 have just attempted to present those 
aspects of the subject most essential for upper-division and graduate-level physics 
courses. In Chap. 4 I introduce abstract groups, but quickly illustrate that concept 
with myriad examples from physics. After all, there would be little point in making 
such an abstract definition if it did not subsume many cases of interest! We then 
introduce Lie groups and their associated Lie algebras, making precise the nature of 
the symmetry ‘generators’ that are so central in quantum mechanics. Much time is 
also spent on the groups of rotations and Lorentz transformations, since these are so 
ubiquitous in physics. 

In Chap. 5 I introduce representation theory, which is a mathematical formaliza- 
tion of what we mean by the ‘transformation properties’ of an object. This subject 
sews together the material from Chaps. 3 and 4, and is one of the most important 
applications of tensors, at least for physicists. Chapter 6 then applies and extends 
the results of Chap. 5 to a few specific topics: the perennially mysterious ‘spheri- 
cal’ tensors, the Wigner-Eckart theorem, and Dirac bilinears. The presentation of 
these later topics is admittedly somewhat abstract, but I believe that the mathemati- 
cally precise treatment yields insights and connections not usually found in the usual 
physicist’s treatment of the subjects. 

This text aims (perhaps naively!) to be simultaneously intuitive and rigorous. 
Thus, although much of the language (especially in the examples) is informal, al- 
most all the definitions given are precise and are the same as one would find in 
a pure math text. This may put you off if you feel less mathematically inclined; I 
hope, however, that you will work through your discomfort and develop the neces- 
sary mathematical sophistication, as the results will be well worth it. Furthermore, 
if you can work your way through the text (or at least most of Chap. 5), you will be 
well prepared to tackle graduate math texts in related areas. 

As for prerequisites, it is assumed that you have been through the usual under- 
graduate physics curriculum, including a “mathematical methods for physicists” 
course (with at least a cursory treatment of vectors and matrices), as well as the 
standard upper-division courses in classical mechanics, quantum mechanics, and 
relativity. Any undergraduate versed in those topics, as well as any graduate stu- 
dent in physics, should be able to read this text. To undergraduates who are eager to 
learn about tensors but have not yet completed the standard curriculum, I apologize; 
many of the examples and practically all of the motivation for the text come from 
those courses, and to assume no knowledge of those topics would preclude discus- 
sion of the many applications that motivated me to write this book in the first place. 
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However, if you are motivated and willing to consult the references, you could cer- 
tainly work through this text, and would no doubt be in excellent shape for those 
upper-division courses once you take them. 

Exercises and problems are included in the text, with exercises occurring within 
the chapters and problems occurring at the end of each chapter. The exercises in 
particular should be done as they arise, or at least carefully considered, as they often 
flesh out the text and provide essential practice in using the definitions. Very few of 
the exercises are computationally intensive, and many of them can be done in a few 
lines. They are designed primarily to test your conceptual understanding and help 
you internalize the subject. Please do not ignore them! 

Besides the aforementioned prerequisites I have also indulged in the use of some 
very basic mathematical shorthand for brevity’s sake; a guide is below. Also, be 
aware that for simplicity’s sake I have set all physical constants such as c and h 
equal to 1. Enjoy! 

Berkeley, USA Nadir Jeevanjee 
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Notation 


Some Mathematical Shorthand 
R The set of real numbers 

C The set of complex numbers 

Z The set of positive and negative integers 

e “is an element of”, “an element of”, i.e. 2 e R reads “2 is an 

element of the real numbers” 

(f: “is not an element of” 


V 

c 


f :a\-+ b 


Ax B 


R" 


“for all” 

“is a subset of”, “a subset of" 

Denotes a definition 

Denotes a map / that takes elements of the set A into 
elements of the set B 

Indicates that the map / sends the element a to the element b 
Denotes a composition of maps, i.e. if f : A— > B and 
g\ B C, then go f : A — >• C is given by 

(g ° f)(a) = gif (a)) 

The set {(a, b)} of all ordered pairs where a e A, b e B . 
Referred to as the cartesian product of sets A and B. 

Extends in the obvious way to n-fold products A\ x • • • x A n 

lx-xl 


C" 

{A\Q} 

□ 


n times 

C x • • • x C 

v V- ' 

n times 

Denotes a set A subject to condition Q. For instance, the set 
of all even integers can be written as [x e R | x/2 e Z} 
Denotes the end of a proof or example 
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Notation 


Dirac Dictionary 1 

Standard Notation Dirac Notation 


Vector \j/ e H 
Dual Vector L(i/r) 
Inner Product (i jr, <p) 
A e C{H) 

(f, A<j>) 

Ti J e' ® ej 

ei ®ej 


\f) 

W I 
(l/ r \<t> ) 
A\f) 
W\A\4>) 


y, i: Tij\M\ 

\i)\j) or | i,j) 


*We summarize here all of the translations given in the text between quantum-mechanical Dirac 
notation and standard mathematical notation. 
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Chapter 1 

A Quick Introduction to Tensors 


The reason tensors are introduced in a somewhat ad-hoc manner in most physics 
courses is twofold: first, a detailed and proper understanding of tensors requires 
mathematics that is slightly more abstract than the standard linear algebra and vec- 
tor calculus that physics students use everyday. Second, students do not necessarily 
need such an understanding to be able to manipulate tensors and solve problems 
with them. The drawback, of course, is that many students feel uneasy whenever 
tensors are discussed, and they find that they can use tensors for computation but 
do not have an intuitive feel for what they are doing. One of the primary aims of 
this book is to alleviate those feelings. Doing that, however, requires a modest in- 
vestment (about 30 pages) in some abstract linear algebra, so before diving into the 
details we will begin with a rough overview of what a tensor is, which hopefully 
will whet your appetite and tide you over until we can discuss tensors in full detail 
in Chap. 3. 

Many older books define a tensor as a collection of objects which carry indices 
and which ‘transform’ in a particular way specified by those indices. Unfortunately, 
this definition usually does not yield much insight into what a tensor is. One of the 
main purposes of the present text is to promulgate the more modern definition of 
a tensor, which is equivalent to the old one but is more conceptual and is in fact 
already standard in the mathematics literature. This definition takes a tensor to be a 
function which eats a certain number of vectors (known as the rank r of the tensor) 
and produces a number. The distinguishing characteristic of a tensor is a special 
property called multilinearity, which means that it must be linear in each of its r 
arguments (recall that linearity for a function with a single argument just means 
that T(y + cw ) = T(y) + cT (w) for all vectors v and w and numbers c). As we 
will explain in a moment, this multilinearity enables us to express the value of the 
function on an arbitrary set of r vectors in terms of the values of the function on 
r basis vectors like x, y, and z. These values of the function on basis vectors are 
nothing but the familiar components of the tensor, which in older treatments are 
usually introduced first as part of the definition of the tensor. 

To make this concrete, consider a rank 2 tensor T, whose job it is to eat two 
vectors v and w and produce a number which we will denote as T(v,w). For such 
a tensor, multilinearity means 

N. Jeevanjee, An Introduction to Tensors and Group Theory for Physicists, 3 
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1 A Quick Introduction to Tensors 


T(V[ + CV 2 , w) — T{v\, w) + cT(v 2 , w) (1.1) 

T(v, Wi + cw 2 ) = T(y, Wi) + cT(y , W 2 ) (1.2) 

for any number c and all vectors v and w . This means that if we have a coordinate 
basis for our vector space, say x, y and z, then T is determined entirely by its values 
on the basis vectors, as follows: first, expand v and w in the coordinate basis as 

v = v x x + v y y + v z z 
w = w x x + w y y + w z z. 

Then by (1.1) and (1.2) we have 

T (v, w)=T (v x x + v y y + v z z, w x x + w y y + w z z) 

= v x T(x, w x x + w y y + w z z) + v y T( y, w x x + w y y + w z z ) 

+ v z T( z, w x x + w y y + w z z ) 

= v x w x T(x,x) + v x w y T (x, y) + v x w z T (x, z) + v y w x T (y, x) 

+ VyWyTiy, y) + v y w z T(y, z) + v z w x T(z,x) + v z w y T( z, y) 

+ v z w z T(z, z). 

If we then abbreviate our notation as 

T xx = T (x, x) 

T xy = T (x, y) (1.3) 

T yx = T (y, x) 

and so on, we have 

T(v, w) = v x w x T xx + v x w y T xy + v x w z T xz + v y w x T yx + v y w y T y y 

+ v y w z T yz + v z w x T zx + v z i VyT zy + v Z W Z T zz (1-4) 

which may look familiar from discussion of tensors in the physics literature. In that 
literature, the above equation is often part of the definition of a second rank tensor; 
here, though, we see that its form is really just a consequence of multilinearity. 
Another advantage of our approach is that the components { T xx , T x v , T xz , . . .} of 
T have a meaning beyond that of just being coefficients that appear in expressions 
like (1.4); from (1.3), we see that components are the values of the tensor when 
evaluated on a given set of basis vectors. This fact is crucial in getting a feel for 
tensors and what they mean. 

Another nice feature of our definition of a tensor is that it allows us to derive 
the tensor transformation laws which historically were taken as the definition of a 
tensor. Say we switch to a new set of basis vectors {x\ y, z'j which are related to 
the old basis vectors by 

x' = A x , x x+ Ax' y y + A x , z z 

y = Ay' X X + Ay'yY Ay' (1*5) 

z = A.£>xX.-\- A z f y y H - A z f z z. 
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This does not affect the action of T , since T exists independently of any basis, but 
if we would like to compute the value of T( v, w) in terms of the new components 
( v ' x , v' y , v ' z ) and (w' x , w' w' z ) of v and w, then we will need to know what the new 
components {T x t x t, T x > y i , T x <-j , . . .} look like. Computing T x ' x ’, for instance, gives 

T X ' X > = T(x,x) 

= T ( A x f x x + A x i y y + A x i z z, A x , x x + A x , y y + A x < z z) 

= A x r x A x ' x T XX + A x >yA x ’ x Ty X T A X ' Z A X I X T ZX + A x l x A x lyT X y 

+ A x 'yA x 'yTyy + Ajy^A^yT^ + A % t X Atf fT XZ + A x > y A x ' z Tyv 

+ A x r z A x r~T zz . (1.6) 

You probably recognize this as an instance of the standard tensor transformation 
law, which used to be taken as the definition of a tensor. Here, the transformation 
law is another consequence of multilinearity! In Chap. 3 we will introduce more 
convenient notation, which will allow us to use the Einstein summation convention, 
so that we can write the general form of (1.6) as 

T i , j , = A k i ,A l j ,T kh (1.7) 

a form which may be more familiar to some readers. 

One common source of confusion is that in physics textbooks, tensors (usually 
of the second rank) are often represented as matrices, so then the student wonders 
what the difference is between a matrix and a tensor. Above, we have defined a 
tensor as a multilinear function on a vector space. What does that have to do with 
a matrix? Well, if we choose a basis {x, y, z}, we can then write the corresponding 
components {T xx , T X y, T xz , . . .} in the form of a matrix [7] as 



Equation (1.4) can then be written compactly as 


T xx 

T 

1 xy 

Txz\ 

(w x \ 


T(v, w ) = ( v x , v y , v z ) T yx 

Tyy 

T yz 

Wy 

(1.9) 

\T ZX 

Tzy 

T z -J 

w 



where the usual matrix multiplication is implied. Thus, once a basis is chosen, the 
action of T can be neatly expressed using the corresponding matrix [T], It is crucial 
to keep in mind, though, that this association between a tensor and a matrix depends 
entirely on a choice of basis, and that [ T ] is useful mainly as a computational tool, 
not a conceptual handle. T is best thought of abstractly as a multilinear function, 
and [T] as its representation in a particular coordinate system. 

One possible objection to our approach is that matrices and tensors are often 
thought of as linear operators which take vectors into vectors, as opposed to objects 
which eat vectors and spit out numbers. It turns out, though, that for a second rank 
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tensor these two notions are equivalent. Say we have a linear operator R: then we 
can turn R into a second rank tensor Tr by 

Tr(v,w) = v-Rw (1.10) 

where ■ denotes the usual dot product of vectors. You can roughly interpret this 
tensor as taking in v and w and giving back the “component of Rw lying along v”. 
You can also easily check that Tr is multilinear, i.e. it satisfies (1.1) and (1.2). If we 
compute the components of Tr we find that, for instance, 

(Tr)xx = Tr(x,x) 

= x Rx 

= X • (Rxx* + Rxy y + Rxzi) 

= R\x 

so the components of the tensor Tr are the same as the components of the linear 
operator Rl In components, the action of Tr then looks like 


( Rxx 

Rxy 

R xz \ 

( Wx \ 


Tr(V, w) = (v x , Vy, V-) Ryx 

Ryy 


Wy 

(1.11) 

V R ZX 

Rzy 

RzzJ 

w 



which is identical to ( 1 .9). This makes it obvious how to turn a linear operator R into 
a second rank tensor — just sandwich the component matrix in between two vectors! 
This whole process is also reversible: we can turn a second rank tensor T into a 
linear operator Rj by defining the components of the vector Rj(v) as 

(Rt(v)) x = T(v,x), (1.12) 

and similarly for the other components. One can check that these processes are 
inverses of each other, so this sets up a one-to-one correspondence between linear 
operators and second rank tensors and we can thus regard them as equivalent. Since 
the matrix form of both is identical, one often does not bother trying to clarify 
exactly which one is at hand, and often times it is just a matter of interpretation. 

How does all this work in a physical context? One nice example is the rank 2 
moment-of-inertia tensor I, familiar from rigid body dynamics in classical mechan- 
ics. In most textbooks, this tensor usually arises when one considers the relationship 
between the angular velocity vector co and the angular momentum L or kinetic en- 
ergy KE of a rigid body. If one chooses a basis {x, y, z} and then expresses, say, KE 
in terms of co, one gets an expression like (1.4) with v = w — co and Tj = 2 )j. From 
this expression, most textbooks then figure out how the 1 must transform under a 
change of coordinates, and this behavior under coordinate change is then what iden- 
tifies the Zj as a tensor. In this text, though, we will take a different point of view. 
We will define T to be the function which, for a given rigid body, assigns to a state 
of rotation (described by the angular momentum vector co) twice the corresponding 
kinetic energy KE. One can then calculate KE in terms of co, which yields an ex- 
pression of the form (1.4); this then shows that T is a tensor in the old sense of the 
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term, which also means that it is a tensor in the modern sense of being a multilinear 
function. We can then think of X as a second rank tensor defined by 

1 

KE=-X(co,co). (1.13) 

From our point of view, we see that the components of X have a physical meaning; 
for instance, I xx = X(k. x) is just twice the kinetic energy of the rigid body when 
a> — x. When one actually needs to compute the kinetic energy for a rigid body, 
one usually picks a convenient coordinate system, computes the components of X 
using the standard formulas, 1 and then uses the convenient matrix representation to 
compute 


1 

KE= -(co x ,CD y ,co z ) 

(Ixx 

Ixy 

Ixz\ 

lyx 

Xyy 

lyz 


\Izx 

Izy 

Izz j 



(1.14) 


which is the familiar expression often found in mechanics texts. 

As mentioned above, one can also interpret X as a linear operator which takes 
vectors into vectors. If we let X act on the angular velocity vector, we get 


(Ixx 

J 

- L xy 

Ixz\ 

(<Vx\ 


lyx 

lyy 

lyz 

1 ° Jy I 

(1.15) 

\Izx 

Izy 

Izz ) 

\Uz) 



which you probably recognize as the coordinate expression for the angular momen- 
tum L . Thus, X can be interpreted either as the rank 2 tensor which eats two copies 
of the angular velocity vector co and produces the kinetic energy, or as the linear 
operator which takes w into L. The two definitions are equivalent. 

Many other tensors which we will consider in this text are also nicely understood 
as multilinear functions which happen to have convenient component and matrix 
representations. These include the electromagnetic field tensor of electrodynam- 
ics, as well as the metric tensors of Newtonian and relativistic mechanics. As we 
progress we will see how we can use our new point of view to get a handle on these 
usually somewhat enigmatic objects. 


1 See Example 3 . 1 4 or any standard textbook such as Goldstein [ 6] . 



Chapter 2 

Vector Spaces 


Since tensors are a special class of functions defined on vector spaces, we must 
have a good foundation in linear algebra before discussing them. In particular, you 
will need a little bit more linear algebra than is covered in most sophomore or ju- 
nior level linear algebra/ODE courses. This chapter starts with the familiar material 
about vectors, bases, linear operators etc. but eventually moves on to slightly more 
sophisticated topics that are essential for understanding tensors in physics. As we 
lay this groundwork, hopefully you will find that our slightly more abstract view- 
point also clarifies the nature of many of the objects in physics you have already 
encountered. 


2.1 Definition and Examples 

We begin with the definition of an abstract vector space. We are taught as under- 
graduates to think of vectors as arrows with a head and a tail, or as ordered triples 
of real numbers, but physics, and especially quantum mechanics, requires a more 
abstract notion of vectors. Before reading the definition of an abstract vector space, 
keep in mind that the definition is supposed to distill all the essential features of 
vectors as we know them (like addition and scalar multiplication) while detaching 
the notion of a vector space from specific constructs, like ordered n -tuples of real or 
complex numbers (denoted as R" and C", respectively). The mathematical utility of 
this is that much of what we know about vector spaces depends only on the essential 
properties of addition and scalar multiplication, not on other properties particular to 
R" or C" . If we work in the abstract framework and then come across other math- 
ematical objects that do not look like R" or C" but that are abstract vector spaces, 
then most everything we know about R' ! and C” will apply to these spaces as well. 
Physics also forces us to use the abstract definition since many quantum-mechanical 
vector spaces are infinite-dimensional and cannot be viewed as C” or R" for any n . 
An added dividend of the abstract approach is that we will learn to think about vector 
spaces independently of any basis, which will prove very useful. 

N. Jeevanjee, An Introduction to Tensors and Group Theory for Physicists, 9 
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2 Vector Spaces 


That said, an ( abstract ) vector space is a set V (whose elements are called vec- 
tors), together with a set of scalars C (for us, C is always R or C) and operations of 
addition and scalar multiplication that satisfy the following axioms: 

1. v + w = w + v for all v, w in V (Commutativity) 

2. v + (w + x) = (v + w) + x for all v , w, x in V (Associativity) 

3. There exists a vector 0 in V such that v + 0 = v for all v in V 

4. For all v in V there is a vector — v such that v + (— v) = 0 

5. c(v + w) — cv + cw for all v and w in V and scalars c (Distributivity) 

6. lv — v for all v in V 

7. (cj + C 2 )v = c\v + civ for all scalars ci, cj and vectors v 

8. (c\C 2 )v = ci(c 2 v) for all scalars c\, C 2 and vectors v 

Some parts of the definition may seem tedious or trivial, but they are just meant 
to ensure that the addition and scalar multiplication operations behave the way we 
expect them to. In determining whether a set is a vector space or not, one is usually 
most concerned with defining addition in such a way that the set is closed under 
addition and that axioms 3 and 4 are satisfied; most of the other axioms are so 
natural and obviously satisfied that one, in practice, rarely bothers to check them. 
That said, let us look at some examples from physics, most of which will recur 
throughout the text. 

Example 2.1 R" 

This is the most basic example of a vector space, and the one on which the abstract 
definition is modeled. Addition and scalar multiplication are defined in the usual 
way: for v = (u 1 , ir, . . . , i>"), w — (w 1 , w 2 , . . . , w n ) in R", we have 

(u 1 , v 2 , v n ) + (in 1 , w 2 , . . . , w") = (u 1 + w 1 , v 2 + w 2 , ...,v n + w n ) 

and 

c(u 1 , v 2 , v n ) = (cv 1 ,cv 2 , . . . , cv 

You should check that the axioms are satisfied. These spaces, of course, are basic 
in physics; R 3 is our usual three-dimensional cartesian space, R 4 * is spacetime in 
special relativity, and R" for higher n occurs in classical physics as configuration 
spaces for multiparticle systems (for example, R 6 is the configuration space in the 


1 Another word about axioms 3 and 4, for the mathematically inclined (feel free to skip this if you 

like): the axioms do not demand that the zero element and inverses are unique, but this actually 

follows easily from the axioms. If 0 and 0' are two zero elements, then 

0 = 0 + O' = 0' 

and so the zero element is unique. Similarly, if —v and —v' are both inverse to some vector v, then 
—v' = — v' + 0 = —v' + (v — v) = (—v' + v) — v = 0 — v = —v 


and so inverses are unique as well. 
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classic two-body problem, as you need six coordinates to specify the position of two 
particles in three-dimensional space). 

Example 2.2 C" 

This is another basic example — addition and scalar multiplication are defined as 
for R", and the axioms are again straightforward to verify. Note, however, that C" is 
a complex vector space, i.e. the set C in the definition is C so scalar multiplication by 
complex numbers is defined, whereas R" is only a real vector space. This seemingly 
pedantic distinction can often end up being significant, as we will see. 2 C" occurs 
in physics primarily as the ket space for finite-dimensional quantum-mechanical 
systems, such as particles with spin but without translational degrees of freedom. 
For instance, a spin 1/2 particle fixed in space has ket space identifiable with C 2 , 
and more generally a fixed particle with spin s has ket space identifiable with C 2,v+I . 

Example 2.3 M n (R) and M n (C), n x n matrices with real or complex entries 

The vector space structure of M„ (R) and M n (C) is similar to that of R" and C" : 
denoting the entry in the i th row and / th column of a matrix A as Ajj, we define 
addition and (real) scalar multiplication for A, B e M n (R) by 


(A + B)jj — Ajj + B,j 
(cA)ij — c A ij 

i.e. addition and scalar multiplication are done component-wise. The same defini- 
tions are used for M n (C), which is of course a complex vector space. You can again 
check that the axioms are satisfied. Though these vector spaces do not appear ex- 
plicitly in physics very often, they have many important subspaces, one of which we 
consider in the next example. 

Example 2.4 H n (C), n x n Hermitian matrices with complex entries 

H n (C), the set of all n x n Hermitian matrices, 3 is obviously a subset of M n (C), and 
in fact it is a sub space of M„ (C) in that it forms a vector space itself. 4 To show this it 
is not necessary to verify all of the axioms, since most of them are satisfied by virtue 
of II n ((C) being a subset of M„(C); for instance, addition and scalar multiplication 
in H n (C) are just given by the restriction of those operations in M„ (C) to H n (C), so 


2 Note also that any complex vector space is also a real vector space, since if you know how to 
multiply vectors by a complex number, then you certainly know how to multiply them by a real 
number. The converse, of course, is not true. 

^Hermitian matrices being those which satisfy A 4 = (A T )* = A, where superscript T denotes the 
transpose and superscript * denotes complex conjugation of the entries. 

4 Another footnote for the mathematically inclined: as discussed later in this example, though, 
H„ ( C ) is only a real vector space, so it is only a subspace of M„ (C) when M„ (C) is considered as 
a real vector space, cf. footnote 2. 
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the commutativity of addition and the distributivity of scalar multiplication over ad- 
dition follow immediately. What does remain to be checked is that H n (C) is closed 
under addition and contains the zero “vector” (in this case, the zero matrix), both 
of which are easily verified. One interesting thing about //„ ((C) is that even though 
the entries of its matrices can be complex, it does not form a complex vector space; 
multiplying a Hermitian matrix by i yields an anfi'-Hermitian matrix, so //„ (C) is 
not closed under complex scalar multiplication. As far as physical applications go, 
we know that physical observables in quantum mechanics are represented by Her- 
mitian operators, and if we are dealing with a finite-dimensional ket space such as 
those mentioned in Example 2.2 then observables can be represented as elements of 
H„ (C). As an example one can take a fixed spin 1 /2 particle whose ket space is C 2 ; 
the angular momentum operators are then represented as L , = jCT;, where the a,- are 
the Hermitian Pauli matrices 



Example 2.5 Z, 2 ([a, b ] ) , square-integrable complex-valued functions on an interval 

This example is fundamental in quantum mechanics. A complex-valued function / 
on [a , b] C R is said to be square-integrable if 

\f(x)\~dx<oo. (2.2) 

Defining addition and scalar multiplication in the obvious way, 

(f + g) O) = f(x) + g(x) 

( cf)(x ) = cf(x), 

and taking the zero element to be the function which is identically zero (i.e. 
fix) = 0 for all x) yields a complex vector space. (Note that if we considered only 
real-valued functions then we would only have a real vector space.) Verifying the ax- 
ioms is straightforward though not entirely trivial, as one must show that the sum of 
two square-integrable functions is again square-integrable (Problem 2.1). This vec- 
tor space arises in quantum mechanics as the set of normalizable wavefunctions for 
a particle in a one-dimensional infinite potential well. Later on we will consider the 
more general scenario where the particle may be unbound, in which case a = — oo 
and b — oo and the above definitions are otherwise unchanged. This vector space is 
denoted as L 2 (R). 

Example 2.6 7T/(R 3 ) and Tli, the harmonic polynomials and the spherical harmon- 
ics 

Consider the set P/(R 3 ) of all complex-coefficient polynomial functions on R 3 of 
fixed degree /, i.e. all linear combinations of functions of the form x l yi z k where 
i + j + k = l. Addition and (complex) scalar multiplication are defined in the usual 
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way and the axioms are again easily verified, so ,P/(R 3 ) is a vector space. Now 
consider the vector subspace Hi(M. 3 ) C P/(R 3 ) of harmonic degree / polynomials, 
i.e. degree / polynomials satisfying A / = 0, where A is the usual three-dimensional 
Laplacian. You may be surprised to learn that the spherical harmonics of degree / 
are essentially elements of 7Y/(R 3 )! To see the connection, note that if we write a 
degree / polynomial 


f(x,y,z)= ^ Cijkx‘y j z k 

i,j,k 

i+j+k=l 

in spherical coordinates with polar angle 9 and azimuthal angle (p , we will get 

f(r,9,<p) = r l Y(9,(p ) 


for some function Y(9,(p), which is just the restriction of / to the unit sphere. If we 
write the Laplacian out in spherical coordinates we get 


9 2 2 9 

dr - ^ r dr 



(2.3) 


where A S 2 is shorthand for the angular part of the Laplacian. 5 You will show be- 
low that applying this to f(r,9,cp) — r l Y (6, if>) and demanding that / be harmonic 
yields 


A S 2Y(9, <p) — —1(1 + 1)Y(0, </>), (2.5) 

which is the definition of a spherical harmonic of degree / ! Conversely, if we take 
a degree I spherical harmonic and multiply it by r , the result is a harmonic func- 
tion. If we let Hi denote the set of all spherical harmonics of degree /, then we have 
established a one-to-one correspondence between Hi and Hi(M. 3 ) \ The correspon- 
dence is given by restricting functions in Hi(M. 3 ) to the unit sphere, and conversely 
by multiplying functions in Hi by r l , i.e. 

H,(R 3 ) 

f — ► /(r = 1,0,0) 
r'y(0,0) < — 7(0,0). 

In particular, this means that the familiar spherical harmonics Y l m (9, (p) are 

just the restriction of particular harmonic degree / polynomials to the unit 


5 The differential operator A S 2 is also sometimes known as the spherical Laplacian, and is given 
explicitly by 


A S 2 


ae 2 


+ cot(? 


a 

30 


i a 2 

sin 2 d a (p 2 


(2.4) 


We will not need the explicit form of A S 2 here. A derivation and further discussion can be found 
in any electrodynamics or quantum mechanics book, like Sakurai [14]. 
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sphere. For instance, consider the case 1 = 1. Clearly 'H\ (R 3 ) = P\ (R 3 ) since all 
first-degree (linear) functions are harmonic. If we write the functions 


x + iy 


Z, 


x — iy 

~7T 


eH i(M 3 ) 


in spherical coordinates, we get 


re 1 ^ sin 9, rcos9, ~j=re sin0 e TL\ (K 3 ) 

v 2 V 2 

which when restricted to the unit sphere yield 


-^—e 1 ^ sin 6, cos 9, — e sin9 eTl\. 

V2 V2 

Up to (overall) normalization, these are the usual degree 1 spherical harmonics Y„ t , 
— 1 < m < 1. The 1 — 2 case is treated in Exercise 2.5 below. Spherical harmon- 
ics are discussed further throughout this text; for a complete discussion, see Stern- 
berg [16]. 


Exercise 2.1 Verify (2.5). 

Exercise 2.2 Consider the functions 


\(,x + iy) 2 , z(x + iy), -^(x 2 + y 2 ~ 2z 2 ),z(x - iy), ^(x - iy) 2 e 


(2.6) 


Verify that they are in fact harmonic, and then write them in spherical coordinates and 
restrict to the unit sphere to obtain, up to normalization, the familiar degree 2 spherical 
harmonics , —2 < m < 2. 


Non-example GL(n, R), invertible n x n matrices 

The ‘general linear group’ GL(n, R), defined to be the subset of M n (R) consisting 
of invertible n x n matrices, is not a vector space though it seems like it could be. 
Why not? 


2.2 Span, Linear Independence, and Bases 

The notion of a basis is probably familiar to most readers, at least intuitively: it is 
a set of vectors out of which we can ‘make’ all the other vectors in a given vector 
space V. In this section we will make this idea precise and describe bases for some 
of the examples in the previous section. 

First, we need the notion of the span of a set of vectors. If S = {i>i , V 2 , ■ ■ ■ , Vk} C 
V is a set of k vectors in V, then the span of S, denoted Span] u i , i> 2 , ■ ■ ■ , } or 

Span S, is defined to be just the set of all vectors of the form c 1 i>i +c 2 U 2 H \-c k Vk- 

Such vectors are known as linear combinations of the p , so Span S is just the set of 
all linear combinations of the vectors in S. For instance, if S — {(1, 0, 0), (0, 1, 0)} C 
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R 3 , then SpanS is just the set of all vectors of the form (c 1 , c 2 , 0) with c 1 , c 2 e R. If 
S has infinitely many elements then the span of S is again all the linear combinations 
of vectors in S, though in this case the linear combinations can have an arbitrarily 
large (but finite) number of terms. 6 

Next we need the notion of linear dependence: a (not necessarily finite) set of vec- 
tors S is said to be linearly dependent if there exists distinct vectors v\, vj, ■ ■ . , v m 
in S and scalars c 1 , c 2 , . . . , c m , not all of which are 0, such that 

c l v\ +c 2 i >2 H f c m v m = 0. (2.7) 

What this definition really means is that at least one vector in S can be written as a 
linear combination of the others, and in that sense is dependent (you should take a 
second to convince yourself of this). If S is not linearly dependent then we say it is 
linearly independent, and in this case no vector in S can be written as a linear com- 
bination of any others. For instance, the set S — {(1, 0, 0), (0, 1, 0), (1, 1, 0)} C R 3 
is linearly dependent whereas the set S' = {(1, 0, 0), (0, 1, 0), (0, 1, 1)} is linearly 
independent, as you can check. 

With these definitions in place we can now define a basis for a vector space V 
as an ordered linearly independent set B C V whose span is all of V . This means, 
roughly speaking, that a basis has enough vectors to make all the others, but no 
more than that. When we say that B — {iq, . . . , i;* } is an ordered set we mean that 
the order of the u, is part of the definition of B, so another basis with the same 
vectors but a different order is considered inequivalent. The reasons for this will 
become clear as we progress. 

One can show 7 that all finite bases must have the same number of elements, so 
we define the dimension of a vector space V, denoted dim V, to be the number of 
elements of any finite basis. If no finite basis exists, then we say that V is infinite- 
dimensional. 

Also, we should mention that basis vectors are often denoted <?,- rather than i>; , 
and we will use this notation from now on. 

Exercise 2.3 Given a vector v and a finite basis B = {e;};= t,...,n, show that the expression 

of v as a linear combination of the e; is unique. 

Example 2.7 R" and C" 

R" has the following natural basis, also known as the standard basis: 

{(1,0 0), (0, 1 0) (0 1)}. 

You should check that this is indeed a basis, and thus that the dimension of R" 
is, unsurprisingly, n. The same set serves as a basis for C", where of course 


6 We do not generally consider infinite linear combinations like XlSi c> v i = limjv->oo c 1 u; 
because in that case we would need to consider whether the limit exists, i.e. whether the sum 
converges in some sense. More on this later. 

7 See Hoffman and Kunze [10]. 
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now the linear combination coefficients c l are allowed to be complex num- 
bers. Note that although this basis is the most natural, there are infinitely many 
other perfectly respectable bases out there; you should check, for instance, that 
{(1, 1, 0, . . . , 0), (0, 1, 1, 0, . . . , 0), . . . , (0, . . . , 1, 1), (1,0,..., 0, 1)} is also a basis 
when n > 2. 

Example 2.8 M n (R) and M n (C) 

Let Ejj be the nxn matrix with a 1 in the z th row, /th column and zeros everywhere 
else. Then you can check that {£;/}/ is a basis for both M„(R) and M„(C), 
and that both spaces have dimension n 2 . Again, there are other nice bases out there; 
for instance, the symmetric matrices Si j = Ejj + Ejj, i < j, and antisymmetric 
matrices A,j = Ejj — Ejj, i < j taken together also form a basis for both M„(R) 
and M , , (C). 

Exercise 2.4 Let S„ (R) , A n (R) be the sets of nxn symmetric and antisymmetric matrices, 
respectively. Show that both are real vector spaces, compute their dimensions, and check 
that dim S n (R) + dim A n (R) = dim M n (R), as expected. 

Example 2.9 H 2 (C) 

Let us find a basis for H 2 (C). First, we need to know what a general element of 
H 2 (C ) looks like. In terms of complex components, the condition A = A' reads 

(: ;)-(i j) 

where the bar denotes complex conjugation. This means that (t.rfel and b — c, so 
in terms of real numbers we can write a general element of H 2 (C ) as 

(xtiy X t-‘z) =tl + X(Tx + yay + Z(Tz (Z9) 

where I is the identity matrix and <j x , a y , a z are the Pauli matrices defined in (2.1). 
You can easily check that the set B — {I, a x , ct v , < t -} is linearly independent, and 
since (2.9) shows that B spans H 2 (C), B is a basis for H 2 (C). We also see that 
H 2 (<C) has (real) dimension 4. 

Exercise 2.5 Using the matrices Sjj and Ajj from Example 2.8, construct a basis for H n (C) 
and compute its dimension. 


Example 2.10 Yj n (d, 0) 

We saw in the previous section that the Y l m are elements of Hi, which can be ob- 
tained from Hi (R 3 ) by restricting to the unit sphere. What is more is that the set 
{Ym }-l<m<l is actually a basis for H/ . In the case / = 1 this is clear: we have 
Hi (R 3 ) = Pi(R 3 ) and clearly + iy), Z, ^(x — iy)} is a basis, and restricting 
this basis to the unit sphere gives the / = 1 spherical harmonics. For / > 1 proving 
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our claim requires a little more effort; see Problem 2.2. Another, simpler basis for 
TL\ (R 3 ) would be the cartesian basis {x, y, z}', physicists use the spherical harmonic 
basis because those functions are eigenfunctions of the orbital angular momentum 
operator L z , which on 7Y/(R 3 ) is represented 8 by L z = — yjy)- We shall 

discuss the relationship between the two bases in detail later. 

Not quite example L 2 ([—a, a]) 

From doing 1-D problems in quantum mechanics one already ‘knows’ that the set 

• nnx r* 

{e “ Inez is a basis for L ([— a, a]). There is a problem, however; we are used to 
taking infinite linear combinations of these basis vectors, but our definition above 
only allows for finite linear combinations. What is going on here? It turns out that 
L 2 ([— a, a]) has more structure than your average vector space: it is an infinite- 
dimensional Hilbert space, and for such spaces we have a generalized definition 
of a basis, one that allows for infinite linear combinations. We will discuss Flilbert 
spaces in Sect. 2.6 


2.3 Components 


One of the most useful things about introducing a basis for a vector space is that 
it allows us to write elements of the vector space as n-tuples, in the form of either 
column or row vectors, as follows: Given v e V and a basis B — {e; },- = i „ for V, 

we can write 


n 

v = ^ v l ei 

i=i 

for some numbers v' , called the components of v with respect to B. We can then 
represent v by the column vector, denoted [i>]g, as 


or the row vector 


Mb = 


/V\ 

,2 


w 


[v]^ = (v\v 2 ,...,v n ) 

where the superscript T denotes the usual transpose of a vector. The subscript B 
just reminds us which basis the components are referred to, and will be dropped if 
there is no ambiguity. With a choice of basis, then, every n -dimensional vector space 


8 As mentioned in the preface, the ft, which would normally appear in this expression, has been set 
to 1. 
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can be made to ‘look like’ R" or C" . Writing vectors in this way greatly facilitates 
computation, as we will see. One must keep in mind, however, that vectors exist 
independently of any chosen basis, and that their expressions as row or column 
vectors depend very much on the choice of basis B. 

To make this last point explicit, let us consider two bases for R 3 : the standard 
basis 


e\ = (1,0,0) 
e 2 = (0, 1,0) 
e 3 = (0, 0, 1) 

and the alternate basis we introduced in Example 2.7 

«i = (1,1,0) 

4 = (o,i,i) 

4 = (i,o,i). 

We will refer to these bases as B and B' , respectively. Let us consider the compo- 
nents of the vector e\ in both bases. If we expand in the basis B, we have 

4 = 1 • e\ + 1 • e 2 + 0 • e 3 


so 


[ e i 1 b ~~ 1 


( 2 . 10 ) 


as expected. If we expand e\ in the basis B' , however, we get 

4 = i ■ 4 + o • 4 + o • 4 

and so 

[ e i \b' = 

which is of course not the same as (2. 10). 9 For fun, we can also express the standard 
basis vector e\ in the alternate basis B'\ you can check that 


1 , 

ei = 2 e > 


1 , 1 , 

2 e2+ 2' 3 


and so 


9 The simple form of [e'Jg' is no accident; you can easily check that if you express any set of basis 
vectors in the basis that they define, the resulting column vectors will just look like the standard 
basis. 
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[ei] B' = 



The following examples will explore the notion of components in a few other 
contexts. 

Example 2.11 Rigid body motion 

One area of physics where the distinction between a vector and its expression as an 
ordered triple is crucial is rigid body motion. In this setting our vector space is R 3 
and we usually deal with two bases, an arbitrary but fixed space axes K' = {x\ y. /'] 
and a time-dependent body axes K — {x(7 ), y(r), '/At)) which is fixed relative to the 
rigid body. These are illustrated in Fig. 2.1. When we write down vectors in R 3 , like 
the angular momentum vector L or the angular velocity vector w, we must keep in 
mind what basis we are using, as the component expressions will differ drastically 
depending on the choice of basis. For example, if there is no external torque on a 
rigid body, [L]^ 1 will be constant whereas [L]k will in general be time-dependent. 

Example 2.12 Different bases for C 2 

As mentioned in Example 2.4, the vector space for a spin 1 /2 particle is identifiable 
with C 2 and the angular momentum operators are given by L, = lo, . In particular, 
this means that 



Fig. 2.1 Depiction of the fixed space axes K ' and the time-dependent body axes K , in gray. K is 
attached to the rigid body and its basis vectors will change in time as the rigid body rotates 


20 


2 Vector Spaces 



and so the standard basis vectors e\, ei are eigenvectors of L z with eigenvalues of 
1 /2 and — 1 /2, respectively. Let us say, however, that we were interested in measur- 
ing L x , where 



Then we would need to use a basis B' of L x eigenvectors, which you can check are 
given by 



If we are given the state e\ and are asked to find the probability of measuring 
L x — 1/2, then we need to expand e\ in the basis B' , which gives 


and so 


1 , , 1 , 
c,= vf ,+ vr 2 


[<?t ]& = 



This, of course, tells us that we have a probability of 1 /2 for measuring L x — +\/2 
(and the same for L x = —1/2). Hopefully this convinces you that the distinction 
between a vector and its component representation is not just pedantic: it can be of 
real physical importance! 


Example 2.13 L 2 ([— a, a]) 


We know from experience in quantum mechanics that all square integrable functions 
on an interval [—a, a ] have an expansion 10 

OO 

/= 2 ^ Cme “ 
m=—oo 

in terms of the ‘basis’ {exp(/^^)} m€ z. This expansion is known as the Fourier 
series of /, and we see that the c m , commonly known as the Fourier coefficients, 

• mitx 

are nothing but the components of the vector / in the basis { e a } m eZ- 


10 This fact is proved in most real analysis books, see Rudin [13]. 
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One of the basic notions in linear algebra, fundamental in quantum mechanics, is 
that of a linear operator. A linear operator on a vector space V is a function T from 
V to itself satisfying the linearity condition 


T (cv + w) = cT ( v ) + T ( w ). 

Sometimes we write T v instead of T (v ). You should check that the set of all linear 
operators on V, with the obvious definitions of addition and scalar multiplication, 
forms a vector space, denoted CiV). You have doubtless seen many examples of 
linear operators: for instance, we can interpret a real n x n matrix as a linear operator 
on R" that acts on column vectors by matrix multiplication. Thus M n (R) (and, 
similarly, M„ (C)) can be viewed as vector spaces whose elements are themselves 
linear operators. In fact, that was exactly how we interpreted the vector subspace 
Hi(C) C M 2 (C) in Example 2.4; in that case, we identified elements of // 2 (C) as 
the quantum-mechanical angular momentum operators. There are numerous other 
examples of quantum-mechanical linear operators: for instance, the familiar position 
and momentum operators x and p that act on L 2 ([—a, a]) by 


(xf)(x) =xf(x) 

( pf)(x ) = -i^f- {x) 

OX 

as well as the angular momentum operators L x , L y , and L z which act on P/(R 3 ) by 


L x (f) = ~i 




L y (f ) = ~i 




L z (f) = ~i 




( 2 . 11 ) 


Another class of less familiar examples is given below. 


Example 2.14 £(V) acting on C{V) 


We are familiar with linear operators taking vectors into vectors, but they can also be 
used to take linear operators into linear operators, as follows: Given A, B e CiV ), 
we can define a linear operator ad,\ e CiCiV )) acting on B by 

ad a (B) = [A,B] 

where [•,•] indicates the commutator. This action of A on C( V ) is called the adjoint 
action or adjoint representation. The adjoint representation has important applica- 
tions in quantum mechanics; for instance, the Heisenberg picture emphasizes C(V) 
rather than V and interprets the Hamiltonian as an operator in the adjoint represen- 
tation. In fact, for any observable A the Heisenberg equation of motion reads 
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— = iad H (A). (2.12) 

at 

□ 

One important property of a linear operator T is whether or not it is invertible, 
i.e. whether there exists a linear operator T~ l such that TT~ 1 = T 1 T = / (where 
/ is the identity operator). 1 1 You may recall that, in general, an inverse for a map 
F between two arbitrary sets A and B (not necessarily vector spaces!) exists if and 
only if F is both one-to-one , meaning 


F(a x ) = F (fl 2 ) 


a\ — 02 Vfl i, «2 £ A, 


and onto, meaning that 


Wb e B there exists a e A such that F (a) = b. 

If this is unfamiliar you should take a moment to convince yourself of this. In the 
particular case of a linear operator T on a vector space V (so that now instead of 
considering F : A — »■ B we are considering T : V —> V , where T has the additional 
property of being linear), these two conditions are actually equivalent. Furthermore, 
they turn out also (see Exercise 2.7 below) to be equivalent to the statement 


T(v) = 0 => u = 0, 

so T is invertible if and only if the only vector it sends to 0 is the zero vector. 

The following exercises will help you parse these notions. 

Exercise 2.6 Suppose V is finite-dimensional and let T G C(V). Show that T being one- 
to-one is equivalent to T being onto. Feel free to introduce a basis to assist you in the proof. 

Exercise 2.7 Suppose T (u) = 0 => v = 0. Show that this is equivalent to T being one- 
to-one, which by the previous exercise is equivalent to T being one-to-one and onto, which 
is then equivalent to T being invertible. 

An important point to keep in mind is that a linear operator is not the same 
thing as a matrix; just as with vectors, the identification can only be made once a 
basis is chosen. For operators on finite-dimensional spaces this is done as follows: 
choose a basis B = {e,- };=i,. Then the action of T is determined by its action on 
the basis vectors: 


( n \ n n 

Jet ) = V ‘ T (««■) = E viT i je J (2- 13 ) 

1=1 / 1=1 i,y=l 

where the numbers 7', 7 , again called the components of T with respect to B. 11 are 
defined by 


"Throughout this text I will denote the identity operator or identity matrix; it will be clear from 
context which is meant. 

^Nomenclature to be justified in the next chapter. 
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T(ei) = J2 T i J ej- 
j = 1 

We then have 


(2.14) 



(v'\ 


/EJUf'Vx 


V 2 

and [T(v)] b = 

EJU v ‘ T i 2 

Ke? = 



\v n ) 


vl;u vi Ti n ) 


which looks suspiciously like matrix multiplication. In fact, we can define the ma- 
trix of T in the basis B, denoted [T]j$, by the matrix equation 


[T( v )\)3 = [T]b[v]b 


where the product on the right-hand side is given by the usual matrix multiplication. 
Comparison of the last two equations then shows that 


[T] B = 


/ T \ 1 
T\ 2 


T 2 l 

t 2 2 



\Tf T 2 n 


T n n J 


(2.15) 


Thus, we really can use the components of T to represent it as a matrix, and once we 
do so the action of T becomes just matrix multiplication by [T]gl Furthermore, if 
we have two linear operators A and B and we define their product (or composition) 
AB as the linear operator 


{AB){v) = A[B{ v)), 

you can then show that [AB] = [A] [5]. Thus, composition of operators becomes 
matrix multiplication of the corresponding matrices. 

Exercise 2.8 For two linear operators A and B on a vector space V, show that [ AB ] = 

[A] [2? ] in any basis. 


Example 2.15 L z , Tii (R 3 ) and spherical harmonics 


Recall that H\ (R 3 ) is the set of all linear functions on R 3 and that {rY^}_ i< m <i = 
{-^(x + iy), z, -^(x — iy)} and {x, y, z} are both bases for this space. 13 Now con- 
sider the familiar angular momentum operator L z = —i(x-^ — yj^) on this space. 


You can check that 


13 We have again ignored the overall normalization of the spherical harmonics to avoid unnecessary 
clutter. 
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L z (x + iy ) = x + iy 
L z (z) = 0 
- oO = - O - iy) 

which implies by (2.14) that the components of L z in the spherical harmonic basis 
are 

(L z ) i' = l 
(L z )\ 2 — (i;)l 3 = 0 

(L z ) 2 ‘ = 0 Vi 
(ij)3 3 = -l 
(^2)3! = (^z ) 3 2 = 0 . 

Thus in the spherical harmonic basis, 

(l 0 0\ 

[LJ {ry t ) = 0 0 0. 

\0 0 -1 ) 

This of course just says that the wavefunctions x + iy, z and x — iy have L z eigen- 
values of 1 , 0, and — 1 , respectively. Meanwhile, 

L z (x) = iy 

L z (y) = -ix 

L z (z) = 0 

so in the cartesian basis, 

/0 -i 0\ 

[ij( w) =h 0 0 , (2.16) 

\0 0 0/ 

a very differently looking matrix. Though the cartesian form of L z is not usu- 
ally used in physics texts, it has some very important mathematical properties, as 
we will see in Part II of this book. For a preview of these properties, see Prob- 
lem 2.3. 

Exercise 2.9 Compute the matrices of L x = — f'Cygj — an d = — * (z gj — x jp) 

acting on H i (R 3 ) in both the cartesian and spherical harmonic bases. 

Before concluding this section we should remark that there is much more one can 
say about linear operators, particularly concerning eigenvectors, eigenvalues and 
diagonalization. Though these topics are relevant for physics, we will not need them 
in this text and good references for them abound, so we omit them. The interested 
reader can consult the first chapter of Sakurai [14] for a practical introduction, or 
Hoffman and Kunze [10] for a thorough discussion. 
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Another basic construction associated with a vector space, essential for understand- 
ing tensors and usually left out of the typical ‘mathematical methods for physicists’ 
courses, is that of a dual vector. Roughly speaking, a dual vector is an object that 
eats a vector and spits out a number. This may not sound like a very natural opera- 
tion, but it turns out that this notion is what underlies bras in quantum mechanics, 
as well as the raising and lowering of indices in relativity. We will explore these 
applications in Sect. 2.7. 

Now we give the precise definitions. Given a vector space V with scalars C, 
a dual vector (or linear functional ) on V is a C-valued linear function / on V, 
where ‘linear’ again means 


f(c v + w) = cf(v) + f(w). 


(2.17) 


(Note that f{v) and f(w) are scalars, so that on the left-hand side of (2.17) the 
addition takes place in V , whereas on the right side it takes place in C.) The set of 
all dual vectors on V is called the dual space of V, and is denoted V*. It is easily 
checked that the usual definitions of addition and scalar multiplication and the zero 
function turn V* into a vector space over C. 

The most basic examples of dual vectors are the following: let {e,} be a (not 
necessarily finite) basis for V, so that an arbitrary vector v can be written as v = 
Ya= 1 n'e, . Then for each i, we can define a dual vector e' by 


e'(v) = v l . 


(2.18) 


This just says that <?' ‘picks off’ the ith component of the vector v. Note that in order 
for this to make sense, a basis has to be specified. 

A key property of dual vectors is that a dual vector / is entirely determined by 
its values on basis vectors: by linearity we have 


m = f 


J2 v ‘ e ‘ 

i=i 


= Y^v‘ f{et) 
i = 1 


i=i 


where in the last line we have defined 


(2.19) 


fi = f(ei) 

which we unsurprisingly refer to as the components of / in the basis { e, } . To justify 
this nomenclature, notice that the e‘ defined in (2.18) satisfy 

e i (e i ) = 8 i j 


(2.20) 
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where S', is the usual Kronecker delta. If V is finite-dimensional with dimension n, it 
is then easy to check (by evaluating both sides on basis vectors) that we can write 14 


n 

/=!>*'' 

1=1 

so that the // really are the components of /. Since / was arbitrary, this means that 
the e‘ span V* . In Exercise 2.10 below you will show that the e l are actually linearly 
independent, so {e‘ } i= i „ is actually a basis for V* . We sometimes say that the e ' 
are dual to the e,. Note that we have shown that V and V* always have the same 
dimension. We can use the dual basis {e‘ } = B* to write / in components, 


[/Is* = 


(fx\ 

h 


\fn ) 


and in terms of the row vector [/]g* we can write (2.19) as 


/W = [/]g.MB = [/] r M 

where in the last equality we dropped the subscripts indicating the bases. Again, we 
allow ourselves to do this whenever there is no ambiguity about which basis for V 
we are using, and in all such cases we assume that the basis being used for V* is 
just the one dual to the basis for V . 

Finally, since e‘ ( v ) = v‘ we note that we can alternatively think of the /th com- 
ponent of a vector as the value of e‘ on that vector. This duality of viewpoint will 
crop up repeatedly in the rest of the text. 

Exercise 2.10 By carefully working with the definitions, show that the e 1 defined in (2.18) 
and satisfying (2.20) are linearly independent. 


Example 2.16 Dual spaces of R" , C" , M n (R) and M n (C) 

Consider the basis {e,} of R" and C", where e ; is the vector with a 1 in the /th 
place and 0’s everywhere else; this is just the basis described in Example 2.7. Now 
consider the element f J of V* which eats a vector in R" or C" and spits out the /th 
component; clearly /■ / (e,) = Sj so the f } are just the dual vectors e J described 
above. Similarly, for M„(R) or M„(C) consider the dual vector f ,J defined by 
f lJ (A) = Aij\ these vectors are clearly dual to the Ejj and thus form the corre- 
sponding dual basis. While the f' J may seem a little unnatural or artificial, you 
should note that there is one linear functional on M n (R) and M n (C) which is famil- 
iar: the trace functional, denoted Tr and defined by 


14 If V is infinite-dimensional then this may not work as the sum required may be infinite, and as 
mentioned before care must be taken in defining infinite linear combinations. 
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n 

Tr (A) = J2 Ai ‘- 

f=l 

What are the components of Tr with respect to the f' J ? 


2.6 Non-degenerate Hermitian Forms 

Non-degenerate Hermitian forms, of which the Euclidean dot product, Minkowski 
metric and Hermitian scalar product of quantum mechanics are but a few examples, 
are very familiar to most physicists. We introduce them here not just to formalize 
their definition but also to make the fundamental but usually unacknowledged con- 
nection between these objects and dual spaces. 

A non-degenerate Hermitian form on a vector space V is a C-valued function 
(• | •) which assigns to an ordered pair of vectors v,weV a scalar, denoted (u|w), 
having the following properties: 

1. (u| u>i + cwj) = (u|wi) + c(u|u; 2 ) (linearity in the second argument) 

2. (i>|w) = (ui|i>) ( Hermiticity ; the bar denotes complex conjugation) 

3. For each w/Oe V, there exists w e V such that (v\iv) f- 0 (non-degeneracy) 

Note that conditions 1 and 2 imply that (cu|w) = c(v\w), so (• | ■) is conjugate- 
linear in the first argument. Also note that for a real vector space, condition 2 implies 
that (• | •) is symmetric, i.e. (v|u;) = (w|u) 15 ; in this case, (• | •) is called a metric. 
Condition 3 is a little nonintuitive but will be essential in the connection with dual 
spaces. If, in addition to the above 3 conditions, the Hermitian form obeys 

4. (u|d) > 0 for all v e V, v ^ 0 (positive-definiteness) 

then we say that (• | •) is an inner product, and a vector space with such a Hermi- 
tian form is called an inner product space. In this case we can think of (u|i>) as 
the ‘length squared’ of the vector v, and the notation ||u|| = v^rju) is sometimes 
used. Note that condition 4 implies 3. Our reason for separating condition 4 from 
the rest of the definition will become clear when we consider the examples. One 
very important use of non-degenerate Hermitian forms is to define preferred sets 
of bases known as orthonormal bases. Such bases B = {e;} by definition satisfy 
(e, \ej) — ±8ij and are extremely useful for computation, and ubiquitous in physics 
for that reason. If (• | •) is positive-definite (hence an inner product), then orthonor- 
mal basis vectors satisfy (e; \e ,) = <5,-; and may be constructed out of arbitrary bases 
by the Gram-Schmidt process. If (• | ■) is not positive-definite then orthonormal 
bases may still be constructed out of arbitrary bases, though the process is slightly 
more involved. See Hoffman and Kunze [ 10], Sects. 8.2 and 10.2 for details. 


15 In this case, (■ | •) is linear in the first argument as well as the second and would be referred to as 
bilinear. 
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Exercise 2.11 Let (• | •) be an inner product. If a set of non-zero vectors e \ . . . . , is 
orthogonal, i.e. (e,\ ej) = 0 when i ^ j, show that they are linearly independent. Note 
that an orthonormal set (i.e. (e; \ej) = ±5y) is just an orthogonal set in which the vectors 
have unit length. 

Example 2.17 The dot product (or Euclidean metric) on R" 

Let v = (v 1 , . . . , v n ), w — (w 1 , . . . , w n ) e R". Define (• | ■) on R" by 

n 

(n|w) = y~Vit/. 
i=l 

This is sometimes written as v ■ w. You should check that (• | •) is an inner product, 
and that the standard basis given in Example 2.7 is an orthonormal basis. 

Example 2.18 The Hermitian scalar product on C" 

Let v = (u 1 , . .. , v' 1 ), w — (w 1 , . . . , w n ) e C". Define (• | ■) on C" by 

n 

(n|w) = (2.21) 

i=l 

Again, you can check that (• | •) is an inner product, and that the standard basis given 
in Example 2.7 is an orthonormal basis. Such inner products on complex vector 
spaces are sometimes referred to as Hermitian scalar products and are present on 
every quantum-mechanical vector space. In this example we see the importance of 
condition 2, manifested in the conjugation of the v 1 in (2.21); if that conjugation was 
not there, a vector like v = (i, 0, . . . , 0) would have (u|u) = — 1 and (• | •) would not 
be an inner product. 

Exercise 2.12 Let A, B e M„(C). Define (■ | •) on M„(C) by 

( J 4|B) = ^Tr(A t fi). (2.22) 

Check that this is indeed an inner product. Also check that the basis [I,o x ,a y ,a z ] for 
HiiC) is orthonormal with respect to this inner product. 

Example 2.19 The Minkowski metric on 4-D spacetime 

Consider two vectors 16 Vj = (x; , y,-, z,-, r, ) e R , i = 1,2. The Minkowski metric, 
denoted rj, is defined to be 17 

r)(vi, v 2 ) = x\x 2 + yi.V 2 + zizi - t\t 2 . (2.23) 


16 These are often called ‘events’ in the physics literature. 

17 We are, of course arbitrarily choosing the + + H — signature; we could equally well choose 
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ri is clearly linear in both its arguments (i.e. it is bilinear) and symmetric, hence 
satisfies conditions 1 and 2, and you will check condition 3 in Exercise 2.13 below. 
Notice that for v — (1, 0, 0, 1), r](v, v) — 0 so ij is not positive-definite, hence not 
an inner product. This is why we separated condition 4, and considered the more 
general non-degenerate Hermitian forms instead of just inner products. 


Exercise 2.13 Let v = (x, y, z, t) be an arbitrary non-zero vector in R 4 . Show that r/ is 
non-degenerate by finding another vector w such that r/(v, w) f 0. 


We should point out here that the Minkowski metric can be written in components 
as a matrix, just as a linear operator can. Taking the standard basis B — { cy } / = i , 2 , 3,4 
in 1R 4 , we can define the components oft), denoted rjjj, as 


iyj = r]{ei,ej). 


(2.24) 


Then, just as was done for linear operators, you can check that if we define the 
matrix of rj in the basis B, denoted [t?]g, as the matrix 


[n]B = 


then we can write 


f mi 

mi 

731 

741^ 


(1 

0 

0 

0 ^ 


m 2 

m 2 

732 

742 


0 

1 

0 

0 


m 3 

723 

733 

743 


0 

0 

1 

0 


\V 14 

724 

734 

744/ 


V° 

0 

0 

-v 







n 

0 

0 

0 \ 

^*0 

2 f[ri][v l ] = 

(X2, yi, Z2,t2) 

0 

0 

1 

0 

0 

1 

0 

0 

yi 

Zl 






v0 

0 

0 

-v 

w 


(2.25) 


(2.26) 


as some readers may be used to from computations in relativity. Note that the sym- 
metry of j] implies that [rj]jg is a symmetric matrix for any basis B. 


Example 2.20 The Hermitian scalar product on if ([—a. a]) 


For /, g e L 2 ([—a, a]), define 

(.f\8) = ^~ [“ fgdx. (2.27) 

2n J-a 

You can easily check that this defines an inner product on L 2 ([— a, a]), and that 

• nnx 

{e a } neZ is an orthonormal set. What is more, this inner product turns L ([— a, a]) 
into a Hilbert Space , which is an inner product space that is complete. The notion of 
completeness is a technical one, so we will not give its precise definition, but in the 
case of Z, 2 ([— a, a]) one can think of it as meaning roughly that a limit of a sequence 
of square-integrable functions is again square-integrable. Making this precise and 
proving it for L 2 ([— a, a]) is the subject of real analysis textbooks and far outside 
the scope of this text. 18 We will thus content ourselves here with just mentioning 


18 See Rudin [13], for instance, for this and for proofs of all the claims made in this example. 
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completeness and noting that it is responsible for many of the nice features of Hilbert 
spaces, in particular the generalized notion of a basis which we now describe. 

Given a Hilbert space 7 i. and an orthonormal (and possibly infinite) set \e, ) C H, 
the set { e,- } is said to be an orthonormal basis for Ti if 

( e i I /) = 0 V/ =► / = 0. (2.28) 

You can check (see Exercise 2.14 below) that in the finite-dimensional case this 
definition is equivalent to our previous definition of an orthonormal basis. In the 
infinite-dimensional case, however, this definition differs substantially from the old 
one in that we no longer require Spanje,- } = 'H (recall that spans only include finite 
linear combinations). Does this mean, though, that we now allow arbitrary infinite 
combinations of the basis vectors? If not, which ones are allowed? For Lr{[— a, a]), 

• mix 

for which {e ° } n eZ is an orthonormal basis, we mentioned in Example 2.13 that 
any / e L 2 ([— a, a]) can be written as 


where 


OO 

/= E 


• mix 

c n e « 


n=—o o 


2 ra 00 

2ah f,2dX = £ |C "' 


(2.29) 


(2.30) 


(The first equality in (2.30) should be familiar from quantum mechanics and follows 
from Exercise 2.15 below.) The converse to this is also true, and this is where the 
completeness of L 2 ([— a, a]) is essential: if a set of numbers c„ satisfy (2.30), then 
the series 


OO 

E ; mix 

C n e « (2.31) 

n=—oo 

converges, yielding a square-integrable function g. So L 2 ([— a,a\) is the set of all 
expressions of the form (2.29), subject to the condition (2.30). Now we know how to 
think about infinite-dimensional Hilbert spaces and their bases: a basis for a Hilbert 
space is an infinite set whose infinite linear combinations, together with some suit- 
able convergence condition, form the entire vector space. 

Exercise 2.14 Show that the definition (2.28) of a Hilbert space basis is equivalent to our 
original definition of a basis for a finite - dimensional inner product space V . 

Exercise 2.15 Show that for / = E^l-oo c « e ' > S = E“=-oo d n< e ‘ e L 2 ([-a, ci]), 

OO 

(f\g)= Y, i 2 - 32 ) 

n=—o o 

so that (■ | •) on L 2 ([— a, a]) can be viewed as the infinite-dimensional version of (2.21), 
the standard Hermitian scalar product on C" . 
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2.7 Non-degenerate Hermitian Forms and Dual Spaces 


We are now ready to explore the connection between dual vectors and non- 
degenerate Hermitian forms. Given a non-degenerate Hermitian form (• | •) on a 
finite-dimensional vector space V, we can associate to any v e V a dual vector 
v e V* defined by 


v(w ) = (u|u;). 

This defines a very important map, 


(2.33) 


L :V -> V* 
vh- 5 

which will crop up repeatedly in this chapter and the next. We will sometimes write 
v as L(v) or ( jj | - ) , and refer to it as the metric dual of u. 19 
Now, L is conjugate-linear since for v = cx + z, v, z, x e V , 


v(w) = ( v\w ) = (cx + z\u>) = c(x\w) + ( z\w ) = cx(w) + z(w) (2.34) 


so 


v = L(cx + z) — cx + z = cL(x) + L(z). (2.35) 

In Exercise 2.16 below you will show that the non-degeneracy of (• | •) implies that 
L is one-to-one and onto, so L is an invertible map from V to V*. This allows us to 
identify V with V *, a fact we will discuss further below. 

Before we get to some examples, it is worth pointing out one potential point of 
confusion: if we have a basis {e,} i= i. „ and corresponding dual basis {e'} ;= | „, 

it is not necessarily true that a dual vector e' in the dual basis is the same 
as the metric dual Lie,)', in particular, the dual basis vector e‘ is defined rela- 
tive to a whole basis {<?, }, = i „, whereas the metric dual L(e,-) only depends on 

what e; is, and does not care if we change the other basis vectors. Furthermore, 
L(gj) depends on your non-degenerate Hermitian form (that is why we call it a met- 
ric dual), whereas e 1 does not (remember that we introduced dual basis vectors in 
Sect. 2.5, before we even knew what non-degenerate Hermitian forms were!). You 
may wonder if there are special circumstances when we do have e' = T(e,); this is 
Exercise 2.17. 

With that caveat let us proceed to some examples, where you will see that you 
are already familiar with L from a couple of different contexts. 

Exercise 2.16 Use the non-degeneracy of (• | •) to show that L is one-to-one, i.e. that 
L{v) = L(ui ) =$■ v = w. Combine this with the argument used in Exercise 2.7 to show 
that L is onto as well. 

Exercise 2.17 Given a basis {e;};=l, ....«> under what circumstances do we have e l =e, for 
all (? 


19 The ■ in the notation (u|-) signifies the slot into which a vector w is to be inserted, yielding the 
number (u|ui). 
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Example 2.21 Bras and kets in quantum mechanics 

Let H be a quantum-mechanical Hilbert space with inner product (■ | •)• In Dirac 
notation, a vector i/r e H is written as a ket | t/r) and the inner product {\jf, (p) is writ- 
ten (i/f\<p). What about bras, written as (t/r |? What, exactly, are they? Most quantum 
mechanics texts gloss over their definition, just telling us that they are in 1-1 cor- 
respondence with kets and can be combined with kets as (i?/ 1(/>) to get a scalar. We 
are also told that the correspondence between bras and kets is conjugate-linear, i.e. 
that the bra corresponding to c|t/r) is c( i/'j- From what we have seen in this section, 
it is now clear that bras really are dual vectors, labeled in the same way as regular 
vectors, because the map L allows us to identify the two. In short, (t/r | is really just 
L (i/t), or equivalently (i/'j - )- 

Example 2.22 Raising and lowering indices in relativity 

Consider R 4 with the Minkowski metric, let B — {e At } |Lt= i i ...,4 and B' = 4 

be the standard basis and dual basis for R 4 (we use a Greek index to conform with 
standard physics notation 20 ), and let v = ^ 4 =1 v^e^ e R 4 . What are the compo- 
nents of the dual vector v in terms of the v 11 ? Well, as we saw in Sect. 2.5, the 
components of a dual vector are just given by evaluation on the basis vectors, so 

V/j, = = (v\e^) = Y lAeulGi) = Y (2-36) 

V V 

In matrices, this reads 


[v] B ' = WbMb (2-37) 

so matrix multiplication of a vector by the metric matrix gives the corresponding 
dual vector in the dual basis. Thus, the map L is implemented in coordinates by [q] . 
Now, we mentioned above that L is invertible; what does L~ l look like in coordi- 
nates? Well, by the above, L -1 should be given by matrix multiplication by [*7] , 

the matrix inverse to [q]. Denoting the components of this matrix by q llv (so that 
q rfl q VT = S(f) and writing / = L~ l (/) where / is a dual vector, we have 

[/] = M _1 [/] (2.38) 

or in components 

f» = Yq v ^f v . (2.39) 

V 

The expressions in (2.36) and (2.39) are probably familiar to you. In physics 
one usually works with components of vectors, and in relativity the numbers v 1 ' 


20 As long as we are talking about ‘standard’ physics notation, you should also be aware that in 
many texts the indices run from 0 to 3 instead of 1 to 4, and in that case the zeroth coordinate 
corresponds to time. 
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are called the contravariant components of v and the numbers v t , = Y^ v v v r] Vfl of 
(2.36) are referred to as the covariant components of v. We see now that the con- 
travariant components of a vector are just its usual components, while its covariant 
components are actually the components of the associated dual vector v. For a 
dual vector /, the situation is reversed — the covariant components / ;i are its actual 
components, and the contravariant components are the components of /. Since L 
allows us to turn vectors into dual vectors and vice versa, we usually do not bother 
trying to figure out whether something is ‘really’ a vector or a dual vector; it can be 
either, depending on which components we use. 

The above discussion shows that the familiar process of “raising” and “lowering” 
indices is just the application of the map L (and its inverse) in components. For an 
interpretation of [>;] -1 as the matrix of a metric on R 4 *, see Problem 2.7. 

Exercise 2.18 Consider R 3 with the Euclidean metric. Show that the covariant and con- 
travariant components of a vector in an orthonormal basis are identical. This explains why 
we never bother with this terminology, nor the concept of dual spaces, in basic physics 
where R 3 is the relevant vector space. Is the same true for R 4 with the Minkowski metric? 

Example 2.23 L 2 ([— a, a]) and its dual 

In our above discussion of the map L we stipulated that V should be finite- 
dimensional. Why? If you examine the discussion closely, you will see that the only 
place where we use the finite-dimensionality of V is in showing that L is onto. Does 
this mean that L is not necessarily onto in infinite dimensions? Consider the Dirac 
Delta functional 8 e L 2 ([— a, a])*. Does 


«(*) = g(0) = («(*)!*) 


(2.40) 


- 'QQ ■ nnx 

for some function 8{x)l If we write g as g(x) = 2^„__ 00 d„e “ , then simply eval- 
uating g at x — 0 gives 


OO 



(2.41) 


which, when compared with (2.32), tells us that the function 8(x) must have Fourier 
coefficients c n — 1 for all n. Such c„, however, do not satisfy (2.30) and hence 8 (x ) 
cannot be a square-integrable function. So the dual vector 5 is not in the image of 
the map L, hence L is not onto in the case of L 2 ([— a, a]). 


2.8 Problems 


2.1 Prove that L 2 ([— a, a]) is closed under addition. You will need the trian- 
gle inequality, as well as the following inequality, valid for all lei: 0 < 
Jl a (\f\ + Mg\) 2 dx. 
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2.2 In this problem we show that {> J Yj n } is a basis for 7T/(R 3 ), which implies that 
{ Y] n } is a basis for H/ . We will gloss over a few subtleties here; for a totally rigorous 
discussion see Sternberg [16] or our discussion in Chap. 4. 

(a) Let f e H/(R 3 ), and write / as f — r ] Y(0 , </>). Then we know that 

A s2 Y = -1(1 + 1)7. (2.42) 

If you have never done so, use the expression (2.4) for A S 2 and the expressions 
(2.1 1) for the angular momentum operators to show that 

Agi = L 2 x +L 2 y + Ll = L 2 

so that (2.42) says that Y is an eigenfunction of L 2 , as expected. You will need 
to convert between cartesian and spherical coordinates. The theory of angular 
momentum 21 then tells us that 7Y/(R 3 ) has dimension 2/ + 1. 

(b) Exhibit a basis for Hi (R 3 ) by considering the function fj = (x + iy) 1 and show- 
ing that 



The theory of angular momentum then tells us that (L~) k fl = fj satisfies 
L :.(/[) = (J~ k)fl and that [fj}o<k<2l is a basis for 7Y/(R 3 ). 

(c) Writing fj = r'Yj_ k we see that Y l m satisfies L 2 y/ n = — 1(1 + 1 )Y l m and L z Yj n = 
m Yj n as expected. Now use this definition of Y l m to compute all the spherical 
harmonics for / = 1 , 2 and show that this agrees, up to normalization, with 
the spherical harmonics as tabulated in any quantum mechanics textbook. If 
you read Example 2.6 and did Exercise 2.5 then all you have to do is compute 
fl , 0 < k < 2 and f? , 0 < k < 4 and show that these functions agree with the 
ones given there. 

2.3 In discussions of quantum mechanics you may have heard the phrase “angular 
momentum generates rotations”. What this means is that if one takes a component 
of the angular momentum such as L z and exponentiates it, i.e. if one considers the 
operator 

oo i 

exp(-/0L,) = V —(-i(pL z )" 
z ' n\ 

n = 0 

— I — i<PL z + —(—i(pL z ) 2 + — (— i(pL z ) 3 H 

(the usual power series expansion for e x ) then one gets the operator which represents 
a rotation about the z axis by an angle <j). Confirm this in one instance by explicitly 
summing this power series for the operator [L z ]{ x ,y,z) of Example 2.15 to get 


21 See Sakurai [ 14] or Gasiorowicz [5] or our discussion in Chap. 4. 
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( COS0 

sin <j> 

0 

the usual matrix for a rotation about the z-axis. 

2.4 Let V be finite-dimensional, and consider the 'double dual’ space ( V*)* . De- 
fine a map 

J: V-> (V*)* 
v h J v 

where J v acts on / e V* by 



J v (f) = f(v). 

Show that J is linear, one-to-one, and onto. This means that we can identify (V*)* 
with V itself, and so the process of taking duals repeats itself after two iterations. 
Note that no non-degenerate Hermitian form was involved in the definition of J ! 

2.5 Consider a linear operator A on a vector space V . We can define a linear oper- 
ator on V* called the transpose of A and denoted by A 7 as follows: 

(A r {f)){v) = f(Av) where veV, f eV*. 

If B is a basis for V and £>* the corresponding dual basis, show that 

Thus the transpose of a matrix really has meaning; it is the matrix representation of 
the transpose of the linear operator represented by the original matrix! 

2.6 This problem will explore the notion of the Hermitian adjoint of a linear oper- 
ator. 

(a) Let A be a linear operator on a finite-dimensional, real or complex vector space 
V with inner product (• | •)• Using the transpose A 1 from the previous problem, 
as well as the map L : V — >■ V* defined in Sect. 2.7, we can construct a new 
linear operator A 1 ; this is known as the Hermitian adjoint of A, and is defined 
as 

A t = L~ X o A t oL : V. (2.43) 

Show that A^ satisfies 

(A^uliu) = (v\Aw), (2.44) 

which is the equation that is usually taken as the definition of the adjoint opera- 
tor. The advantage of our definition (2.43) is that it gives an interpretation to A 1 ; 
it is the operator you get by transplanting A T , which originally acts on V*, over 
to V via L . 


36 


2 Vector Spaces 


(b) Show that in an orthonormal basis {e;};= 

[A + ] = [A]*, 

where the dagger outside the brackets denotes the usual conjugate transpose of 
a matrix (if V is a real vector space, then the dagger outside the brackets will 
reduce to just the transpose). You may want to prove and use the fact A/ = 
e'(Aej). 

(c) If A satisfies A — Af A is then said to be self-adjoint or Hermitian. Since A ' 
is defined with respect to an inner product, self-adjointness indicates a certain 
compatibility between A and the inner product. Show that even when V is a 
complex vector space, any eigenvalue of A must be real if A is self-adjoint. 

(d) In part (b) you showed that in an orthonormal basis the matrix of a Hermitian 
operator is a Hermitian matrix. Is this necessarily true in a non-orthonormal 
basis? 

2.7 Let g be a non-degenerate bilinear form on a vector space V (we have in 
mind the Euclidean metric on R 3 or the Minkowski metric on R 4 ). Pick an arbitrary 
(not necessarily orthonormal) basis, let [g] _1 be the matrix inverse of [g] in this 
basis, and write g / ' 1 ' for the components of [g] _1 . Also let /, h e V*. Define a non- 
degenerate bilinear form g on V* by 

g(f, h) = g{f ,h) (2.45) 

where / = L~ l (f) as in Example 2.22. Show that 

g^ = g(^,0 = g^ (2.46) 

so that [g]~ 1 is truly a matrix representation of a non-degenerate bilinear form 
on V*. 

2.8 In this problem we will acquaint ourselves with P (R) , the set of polynomials in 
one variable x with real coefficients. We will also meet several bases for this space 
which you should find familiar. 

(a) P (M) is the set of all functions of the form 

/(.*) = co + cix + cjx 2 H f c n x n (2.47) 

where n is arbitrary. Verify that P(M) is a (real) vector space. Then show that 
P (M) is infinite-dimensional by showing that, for any finite set S C P (M) , there 
is a polynomial that is not in Span S. Exhibit a simple infinite basis for P(&L). 

(b) Compute the matrix corresponding to the operator 4- e C(P(K)) with respect 
to the basis you found in part (a). 

(c) One can turn P (M) into an inner product space by considering inner products 
of the form 

(f\g)= f f(x)g(x)W(x)dx 


(2.48) 
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where W{x) is a nonnegative weight function. One can then take the basis 
B = {1, x, x 2 , x 3 , . . .} and apply the Gram-Schmidt process to get an orthogo- 
nal basis. With the proper choice of range of integration [a, b] and weight func- 
tion W(x), we can obtain (up to normalization) the various orthogonal poly- 
nomials one meets in studying the various differential equations that arise in 
electrostatics and quantum mechanics. 

(i) Let [a, b] — [— 1, 1] and W(x) — 1 . Consider the set S — {1, x, x 2 , x 3 } C B. 
Apply the Gram-Schmidt process to this set to get (up to normalization) 
the first four Legendre Polynomials 

P 0 (x) = 1 
Pi (x) — X 

Pi(x) = \{ 3x 2 -l) 

P 2 (x) — -(5x 3 — 3x). 

The Legendre Polynomials show up in the solutions to the differential 
equation (2.42), where we make the identification x = cos 6. Since — 1 < 

cos 9 < 1, this explains the range of integration in the inner product. 

_ 2 

(ii) Now let [a, b] = (— oo, oo) and W(x) — e x . Use Gram-Schmidt on S to 
get the first four Hermite Polynomials 

H 0 (x) = 1 
H\ (x) = 2x 
H 2 (x)=4x 2 -2 
H 3 (x) = 8jt 3 — 12x. 

These polynomials arise in the solution to the Schrodinger equation for 
a one-dimensional harmonic oscillator. Note that the range of integration 
corresponds to the range of the position variable, as expected. 

(iii) Finally, let [a,b] = (0, oo) and W(x) = e~ x . Again, use Gram-Schmidt 
on S to get the first four Laguerre Polynomials 

Lq(x) = 1 

L i (x) = —x + 1 

L 2 (x) = ^(x 2 - 4x + 2) 

L 3 (x) = g(~ x3 + 9* 2 — 18x + 6). 

These polynomials arise as solutions to the radial part of the Schrodinger 
equation for the hydrogen atom. In this case x is interpreted as a radial 
variable, hence the range of integration (0, oo). 



Chapter 3 

Tensors 


Now that we are familiar with vector spaces we can finally approach our main sub- 
ject, tensors. We will give the modern component-free definition, from which will 
follow the usual transformation laws that used to be the definition. 

From here on out we will employ the Einstein summation convention, which is 
that whenever an index is repeated in an expression, once as a superscript and once 
as a subscript, then summation over that index is implied. Thus an expression like 
v = Yl'i= i v ‘ e i becomes v = v ! e,-. We will comment on this convention in Sect. 3.2. 
Also, in this chapter we will treat mostly finite-dimensional vector spaces and ignore 
the complications and technicalities that arise in the infinite-dimensional case. In the 
few examples where we apply our results to infinite-dimensional spaces, you should 
rest assured that these applications are legitimate (if not explicitly justified), and that 
rigorous justification can be found in the literature. 


3.1 Definition and Examples 

A tensor of type (r, s) on a vector space V is a C-valued function T on 

V x • - • x V x V* x • - • x V* (3.1) 

r times s times 

which is linear in each argument, i.e. 

T (l>i + cw, V2 IV, f\ f s ) 

= T(Vi, . . . , V r , fl, . . . , fs) + cT(w, V2, ■ ■ ■ , /l, ■ ■ • , fs) (3.2) 

and similarly for all the other arguments. This property is called multilinearity . Note 
that dual vectors are (1, 0) tensors, and that vectors can be viewed as (0, 1) tensors: 

v(f) = f(v) where veV, feV*. (3.3) 

Similarly, linear operators can be viewed as (1, 1) tensors as 

A(v,f) = f{Av). (3.4) 


N. Jeevanjee, An Introduction to Tensors and Group Theory for Physicists, 
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We take (0, 0) tensors to be scalars, as a matter of convention. You will show in 
Exercise 3.1 below that the set of all tensors of type (r,s) on a vector space V, 
denoted Tf (V) or just T r , form a vector space. This should not come as much 
of a surprise since we already know that vectors, dual vectors and linear operators 
all form vector spaces. Also, just as linearity implies that dual vectors and linear 
operators are determined by their values on basis vectors, multilinearity implies the 

same thing for general tensors. To see this, let {e, } i= i „ be a basis for V and 

{e'}; = i the corresponding dual basis. Then, denoting the ith component of the 
vector v p as v‘ p and the / th component of the dual vector f q as f q j, we have (by 
repeated application of multilinearity) 

T(vi , ..., v r , fu f s ) = i// ...v l ; ... f sjs T(e h , ...,e ir ,e J1 , ...,e Js ) 

— uj 1 . . . v‘ r r fi jl . . . fs js Ti l ,,,,j r ' l '" is 
where, as before, the numbers 


Th..., 


Jl-Js 


= T(e 


ir ) 


oJr 


(3.5) 


are called the components of T in the basis {e, }, = i „ . You should check that this 

definition of the components of a tensor, when applied to vectors, dual vectors, and 
linear operators, agrees with the definitions given earlier. Also note that (3.5) gives 
us a concrete way to think about the components of tensors: they are the values of 
the tensor on the basis vectors. 


Exercise 3.1 By choosing suitable definitions of addition and scalar multiplication, show 
that T S ' (V) is a vector space. 

If we have a non-degenerate bilinear form on V, then we may change the type 
of T by precomposing with the map L or L~ l . If T is of type (1,1) with com- 
ponents Tj ] , for instance, then we may turn it into a tensor 7 of type (2,0) by 
defining T(v, w) = T ( v . L(w)). This corresponds to lowering the second index, 
and we write the components of T as Tij. omitting the tilde since the fact that we 
lowered the second index implies that we precomposed with L. This is in accord 
with the conventions in relativity, where given a vector tel 4 we write v M for the 
components of v when we should really write C /t . From this point on, if we have 
a non-degenerate bilinear form on a vector space then we permit ourselves to raise 
and lower indices at will and without comment. In such a situation we often do 
not discuss the type of a tensor, speaking instead of its rank, equal to r + s, which 
obviously does not change as we raise and lower indices. 


Example 3.1 Linear operators in quantum mechanics 


Thinking about linear operators as (1, 1) tensors may seem a bit strange, but in fact 
this is what one does in quantum mechanics all the time! Given an operator H on 
a quantum mechanical Hilbert space spanned by orthonormal vectors {e, } (which 
in Dirac notation we would write as { | / ) } ) , we usually write H\i) for 77(e,), {j\i) 
for ej(ei) — (ej\ei), and (j\H\i) for ( ej\Hei ). Thus, (3.4) would tell us that (using 
orthonormal basis vectors instead of arbitrary vectors) 
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Hi j = H(e h e J ) 

= e J (Hei) 

= U\H\i) 

where we converted to Dirac notation in the last equality to obtain the familiar quan- 
tum mechanical expression for the components of a linear operator. These compo- 
nents are often referred to as matrix elements , since when we write operators as 
matrices the elements of the matrices are just the components arranged in a particu- 
lar fashion, as in (2.15). 

Example 3.2 The Levi-Civita tensor 

Consider the (3, 0) Levi-Civita tensor e on R 3 defined by 

c(m, v, w) = (u x v) ■ w, u, v,we R 3 . (3.6) 

You will check below that e really is multilinear, hence a tensor. It is well-known 
from vector calculus that (u x i>) • w is the (oriented) volume of a parallelepiped 
spanned by u, v, and w (see Fig. 3.1), so one can think of the Levi-Civita tensor as 
a kind of ‘volume operator’ which eats three vectors and spits out the volume that 
they span. 

What about the components of the Levi-Civita tensor? If {ei, e 2 , e$} is the stan- 
dard basis for R 3 then (3.6) yields 

f-ijk = e (<h ,ej, e*) 

= (ei x e j) ek 
= €ijk 

where ejjk is the usual Levi-Civita symbol (defined below). Thus the Levi-Civita 
symbol represents the components of an actual tensor, the Levi-Civita tensor! Fur- 
thermore, keeping in mind the interpretation of the Levi-Civita tensor as a volume 
operator, as well as the fact that the components e,y* are just the values of the tensor 
on the basis vectors, then we find that the usual definition of the Levi-Civita symbol. 



Fig. 3.1 The parallelepiped spanned by u, v and w 
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€ ijk '■ 


+ 1 if {/ jk} = {1,2,3}, {2,3,1}, or {3, 1,2} 
-1 if {i jk} = {3,2, 1}, {1,3,2}, or {2, 1,3} 
0 otherwise, 


is just telling us, for instance, that a parallelepiped spanned by {ei, eo, e 3 } has ori- 
ented volume + 1 ! 


Exercise 3.2 Verify that the Levi-Civita tensor as defined by (3.6) really is multilinear. 
Example 3.3 The moment of inertia tensor 

The moment of inertia tensor, denoted X, is the symmetric (2, 0) tensor on R 3 which, 
when evaluated on the angular velocity vector, yields the kinetic energy of a rigid 
body, i.e. 

1 

-l(o),a>) = KE. (3.7) 

Alternatively we can raise an index on X and define it to be the linear operator which 
eats the angular velocity and spits out the angular momentum, i.e. 

L = Xco. (3.8) 

Equations (3.7) and (3.8) are most often seen in components (referred to a cartesian 
basis), where they read 

KE = [w] T [X][eo] 

[L] = [2] [co]. 

Note that since we raise and lower indices with an inner product and usually use 
orthonormal bases, the components of X when viewed as a (2, 0) tensor and when 
viewed as a (1, 1) tensor are the same, cf. Exercise 2.18. We’ll discuss X further in 
Sect. 3.6. 


Example 3.4 Multipole moments 


It is a standard result from electrostatics that the scalar potential <T> (r) of a charge 
distribution p(r') localized around the origin in R 3 can be expanded in a Taylor 
series as 1 


d>(r) = 


1 

4tt 



Q ] (r) | 1 62 (r. r) 

r 3 2! r 5 


1 e 3 (r,r,r) 
3! r 1 


(3.9) 


where the Q r are rth rank tensors known as the multipole moments of the charge 
distribution p(r'). The first few multipole moments are familiar to most physicists: 
the first, <2o, is just the total charge or monopole moment of the charge distribution 
and is given by 


1 Here and below we set all physical constants such as c and 60 equal to 1 . 
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Qo = f P( r ') d 3 r'. (3.10) 

The second, Q \ , is a dual vector known as the dipole moment (often denoted as p), 
which has components 



The third multipole moment, Qi, is known as the quadrupole moment and has 
components given by 

Qij = J (3 x' { x'j - r f2 8ij) d\'. (3.12) 

Notice that the Q;j are symmetric in i and j, and that JT Q,j = 0. Analogous 
properties hold for the higher order multipole moments as well (i.e. the octopole 
moment Qt, has components Qijk which are totally symmetric and which satisfy 
12 i Qiij — Qijj — JT Q j a = 0). We will explain these curious features of the 
Q r at the end of this chapter. 

Example 3.5 Metric tensors 

We met the Euclidean metric on R" in Example 2.17 and the Minkowski metric on 
R 4 in Example 2.19, and it is easy to verify that both are (2, 0) tensors (why isn’t 
the Hermitian scalar product of Example 2.21 included?). We also have the inverse 
metrics, defined in Problem 2.7, and you can verify that these are (0, 2) tensors. 

Exercise 3.3 Show that for a metric gonV, 

8i J =Si J , (3-13) 

so the (1,1) tensor associated to g (via g!) is just the identity operator. You will need the 
components g ,J of the inverse metric, defined in Problem 2.7 or Example 2.22. 


3.2 Change of Basis 

Now we are in a position to derive the usual transformation laws that historically 
were taken as the definition of a tensor. Suppose we have a vector space V and two 
bases for V, B = {e, ),=i „ and B' — {e,' }/'=i, Since B is a basis, each of the 

e,v can be expressed as 

er — Aj,ej (3.14) 

for some numbers A-,. Likewise, there exist numbers Aj (note that here the upper 
index is primed) such that 

e i =A 1 .ey. 


(3.15) 
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We then have 

ei = A{ ey = Aj' A),e k (3.16) 

and can then conclude that 

A{A k jr =Sf. (3.17) 

Considering (3.16) with the primed and unprimed indices switched also yields 

A i i ,A k -=^ l , (3.18) 

so, in a way, Aj and Aj, are inverses of each other. Notice that A j, and A j are not to 
be interpreted as the components of tensors, as their indices refer to different bases. 2 
How do the corresponding dual bases transform? Let {e'} i= i „ and {e‘ } i= i „ be 

the bases dual to B and B' . Then the components of e 1 ' with respect to je'}, = ] „ 

are 


e i \e j ) = e i \A k ' j e k ) = A k :&l,=A i ' j , 


(3.19) 


i.e. 


/ = A'jefi (3.20) 

Likewise, 

e i =A i j ,e i '. (3.21) 

Notice how well the Einstein summation convention and our convention for priming 
indices work together in the transformation laws. Now we are ready to see how the 
components of a general (r. s) tensor T transform: 


7 }/ h-is = T (e,7 , . . . , e,v , eh , . . . , ) 

= T(A k /e k A k ;e kr ,A k {e l i,...,Afe 1 ’) 

= A%...A k ?A k j...A£T(e kl ,...,e kr ,e\...,e '*) 


A k 1 A k r \h jJst ll—h 

A i[ ■■■ A i' r A h ■■■ A h lk l~ k r 


(3.22) 


Equation (3.22) is the standard tensor transformation law, which is taken as the 
definition of a tensor in much of the physics literature; here we have derived it as 
a consequence of our definition of a tensor as a multilinear function on V and V * . 
The two are equivalent, however, as you will check in Exercise 3.4 below. With the 
general transformation law in hand, we will now look at specific types of tensors and 


2 This is also why we wrote the upper index directly above the lower index, rather than with a 
horizontal displacement as is customary for tensors. For more about these numbers and a possible 
inteipretation, see the beginning of the next section. 
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e2 



Fig. 3.2 The standard basis B and a new one B' obtained by rotation through an angle 6 


derive their matrix transformation laws; to this end, it will be useful to introduce the 
matrices 



( A \ 

tl' 

A 2 

■ a;;\ 


( A v 

A \> ■ 

■ A x ,\ 

n' 


A — 

A? 

A 2 ’ 

a 2 

■ A2 n 

A-‘ = 

A % 

A 2 

/1 2 / 

A 2 

’ n' 



Uf 

A n ' 

/i 2 , 

■ At) 


\A", 

A”, . 

■ A’ 1 ,/ 

n' ' 



By virtue of (3.17) and (3.18), these matrices satisfy 


AA~ l =A~ 1 A = I (3.24) 


as our notation suggests. 

Exercise 3.4 Consider a function which assigns to a basis a set of numbers 

[Tk\ ...k, 1 ' ' ’ } which transform according to (3.22) under a change of basis. Use this assign- 
ment to define a multilinear function T of type (r, s) on V, and be sure to check that your 
definition is basis-independent (i.e. that the value of T does not depend on which basis 
{e;};=l,..., n you choose). 

Example 3.6 Change of basis matrix for a 2-D rotation 

As a simple illustration of the formalism, consider the standard basis B in R 2 and 
another basis B 1 obtained by rotating B by an angle 0. This is illustrated in Fig. 3.2. 
By inspection we have 


e\i — cos 9e\ + sin$e 2 
e 2 ’ = — sin0ei + cos 0 e 2 

Aj, =cos<9, A\, = — sinO 

A 2 V = sin0, A\,= cosO. 


and so by (3.14) we have 


(3.25) 

(3.26) 
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Equation (3.23) then tells us that 

i _ / cos 9 — sin# 

— Y sin 9 cos 6 

The numbers Aj and the corresponding matrix A can be computed by either invert- 
ing A -1 , or equivalently by inverting the system (3.25)— (3.26) and proceeding as 
above. □ 

Example 3.7 Vectors and dual vectors 

Given a vector v (considered as a (0, 1) tensor as per (3.3)), (3.22) tells us that its 
components transform as 

v i ' = A i ' j v j (3.27) 

while the components of a dual vector / transform as 

fi' = 4fi ■ (3-28) 

Notice that the components of v transform with the A', whereas the basis vectors 

j 

transform with the A jt , so the components of a vector obey the law opposite (‘con- 
tra’) to the basis vectors. This is the origin of the term contravariant’. Note also 
that the components of a dual vector transform in the same way as the basis vectors, 
hence the term ‘covariant’. It makes sense that the basis vectors and the compo- 
nents of a vector should transform oppositely; v exists independently of any basis 
for V and should not change under a change of basis, so if the e, change one way 
the v‘ should change oppositely. Similar remarks apply to dual vectors. 

Aside Incidentally, we can now explain a little bit more about the Einstein summation conven- 
tion. We knew ahead of time that the components of dual vectors would transform like basis vec- 
tors, so we gave them both lower indices. We also knew that the components of vectors would 
transform like dual basis vectors, so we gave them both upper indices. Since the two transfor- 
mation laws are opposite, we know (see below) that a summation over an upper index and lower 
index will yield an object that does not transform at all, so the summation represents an object on 
a process that is invariant, i.e. the expression v'e/ represents the vector i>, which is defined without 
reference to any basis. Also, the expression /; v l is just /(u), the action of the functional / on 
the vector v, also defined without reference to any basis. Processes such as these are so important 
and ubiquitous that it becomes very convenient to omit the summation sign for repeated upper 
and lower indices, and we thus have the summation convention. Occasionally one encounters two 
repeated upper indices or two repeated lower indices that are to be summed over; we choose to 
indicate summation explicitly in these cases, rather than conventionally omit the summation sign, 
because in these cases there is usually some assumption at work, e.g. the assumption of an or- 
thonormal basis (as in (3.32) below). In such cases the formulas are not completely general, and 
only represent invariant processes when the accompanying assumption is satisfied. 

Returning to our discussion of how components of vectors and dual vectors trans- 
form, we can write (3.27) and (3.28) in terms of matrices as 

Mb' = A\v]b 


(3.29) 

(3.30) 
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where the superscript T again denotes the transpose of a matrix. From the ‘aside’ 
above, we know that f(v) is basis-independent, but we also know that f(v) = 
[/]g[i;]g. This last equation then must be true in any basis, and we can in fact 
prove this using (3.29) and (3.30): in a new basis £>', we have 

= A[vi B 

= [f] T B A- l A[v] B 

= lf] B lv] B . (3.31) 

This makes concrete our claim above that [/] transforms ‘oppositely’ to [u], so that 
the basis-independent object f(v ) really is invariant under a change of basis. 

Before moving on to our next example we should point out a minor puzzle: you 
showed in Exercise 2.18 that if we have an inner product (■ | ■ ) on a real vector space 
V and an orthonormal basis {e, }, = i „ then the components of vectors and their 
corresponding dual vectors are identical, which is why we were able to ignore the 
distinction between them for so long. Equations (3.29) and (3.30) seem to contradict 
this, however, since it looks like the components of dual vectors transform very 
differently from the components of vectors. How do we explain this? Well, if we 
change from one orthonormal basis to another, we have 

n 

Sfj > = ieAey) = A*A l j,(e k \ei) = J2 A r A p 0-32) 

k= 1 

which in matrices reads 


t — 17 


A~ l = I 


so we must have 


t-H 


= A 


<=► A" 1 = A T . 

Such matrices are known as orthogonal matrices, and we see here that a transforma- 
tion from one orthonormal basis to another is always implemented by an orthogonal 
matrix. 3 For such matrices (3.29) and (3.30) are identical, resolving our contradic- 
tion. 

Incidentally, for a complex inner product space you will show that orthonormal 
basis changes are implemented by matrices satisfying A -1 = A 1 . Such matrices are 
known as unitary matrices and should be familiar from quantum mechanics. 

Exercise 3.5 Show that for any invertible matrix A, (A~ l ) T = (A r ) _1 , justifying the slop- 
piness of our notation above. 

Exercise 3.6 Show that for a complex inner product space V , the matrix A implementing 
an orthonormal change of basis satisfies A" 1 = A 3 . 


3 See Problem 3.1 for more on orthogonal matrices, as well as Chap. 4. 
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Example 3.8 Linear operators 

We already noted that linear operators can be viewed as (1, 1 ) tensors as per (3.4). 
(3.22) then tells us that, for a linear operator T on V, 

t/ = a*,a/V 

which in matrix form reads 


[T] b , = A[T] B A~ l (3.33) 

which is the familiar similarity transformation of matrices. This, incidentally, al- 
lows us to extend the trace functional from n x n matrices to linear operators as 
follows: Given T e Ci V) and a basis B for V, define the trace of T as 

Tr(r) = Tr([r]g). 

You can then use (3.33) to show (see Exercise 3.9) that Tr(T) does not depend on 
the choice of basis B. 

Exercise 3.7 Show that for v e V, f e V*, T e C(V), f(T v) = [f] 1 [r][n] is invariant 
under a change of basis. Use the matrix transformation laws as we did in (3.31). 

Exercise 3.8 Let B = { x , y, z}, B' = [x + iy, z, x — iy} be bases for H i(R 3 ), and consider 
the operator L z for which matrix expressions were found with respect to both bases in 
Example 2.15. Find the numbers At and Aj, and use these, along with (3.33), to obtain 
[L.] S / from [L Z \ B . 

Exercise 3.9 Show that (3.33) implies that Tr([r]g) does not depend on the choice of 
basis B, so that Tr(r) is well-defined. 

Example 3.9 (2, 0) tensors 

(2, 0) tensors g, which include important examples such as the Minkowski metric 
and the Euclidean metric, transform as follows according to (3.22): 

Si' j' = A k A l j,g k i 

or in matrix form 

[g]& = A~ lT [g] B A ~ l . (3.34) 

Notice that if g is an inner product and B and B' are orthonormal bases then [g] B i — 
[g] B — I and (3.34) becomes 

i = a~ iT a~\ 

again telling us that A must be orthogonal. Also note that if A is orthogonal, (3.34) 
is identical to (3.33), so we do not have to distinguish between (2, 0) tensors and 
linear operators (as most of us have not in the past!). In the case of the Minkowski 
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metric ;; we are not dealing with an inner product but we do have orthonormal bases, 
with respect to which 4 rj takes the form 


M = 


/i o o 
0 1 0 
0 0 1 
v 0 0 0 


0 \ 
0 

0 

-v 


(3.35) 


If we are changing from one orthonormal basis to another we then have 


/I 

0 

0 

0 ^ 


(\ 

0 

0 

0 ^ 


0 

1 

0 

0 

= a- iT 

0 

1 

0 

0 

A- 1 

0 

0 

1 

0 

0 

0 

1 

0 

\0 

0 

0 

-v 


\0 

0 

0 

-v 



or equivalently 


/I 

0 

0 

0 ^ 


(l 

0 

0 

0 

1 

0 

0 

= a t 

0 

1 

0 

0 

0 

1 

0 


0 

0 

1 

\0 

0 

0 

-v 


\0 

0 

0 


0 \ 
0 

0 

-V 


(3.36) 


(3.37) 


Matrices A satisfying (3.37) are known as Lorentz Transformations . Notice that 
these matrices are not quite orthogonal, so the components of vectors will transform 
slightly differently than those of dual vectors under these transformations. This is 
in contrast to the case of M" with a positive-definite metric, where if we go from 
one orthonormal basis to another, then the components of vectors and dual vectors 
transform identically, as you showed in Exercise 2.18. □ 


Exercise 3.10 As in previous exercises, show using the matrix transformation laws that 
g( v, w) = [u'] r [g][u] is invariant under a change of basis. 


3.3 Active and Passive Transformations 

Before we move on to the tensor product, we have a little unfinished business to 
conclude. In the last section when we said that the A- were not the components of 
a tensor, we were lying a little; there is a tensor lurking around, namely the linear 
operator U that takes the new basis vectors into the old, i.e. U (ej>) = e, V/ (the 
action of U on an arbitrary vector is then given by expanding that vector in the basis 
B' and using linearity). What are the components of this tensor? Well, in the old 
basis B we have 

Ui j = U{ ei ,e J ) - e j (Uei ) = e J (u(Afe k >)) - Afe J (U(e k f) = A? e j (e k ) 

= A{ (3.38) 


4 We assume here that the basis vector e t satisfying tj (e , , e ( ) = — 1 is the fourth vector in the basis, 
which is not necessary but is somewhat conventional in physics. 
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so the A j actually are the components of a tensor! 5 Why did we lie, then? Well, 
the approach we have been taking so far is to try and think about things in a basis- 
independent way, and although U is a well-defined linear operator, its definition 
depends entirely on the two bases we have chosen, so we may as well work directly 
with the numbers that relate the bases. Also, using one primed index and one un- 
primed index makes it easy to remember transformation laws like (3.20) and (3.21), 
but is not consistent with our notation for the components of tensors. 

If we write out the components of U as a matrix, you should verify that 

[e,]B = [U]s[ei']B = A[eAl3 (3.39) 

which should be compared to (3.29), which reads [v]b' — A[v]b- Equation (3.39) is 
called an active transformation, since we use the matrix A to change one vector into 
another, namely eg into e , . Note that in (3.39) all vectors are expressed in the same 
basis. Equation (3.29), on the other hand, is called a passive transformation, since 
we use the matrix A not to change the vector v but rather to change the basis which 
v is referred to, hence changing its components. The notation in most physics texts 
is not as explicit as ours; one usually sees matrix equations like 

r' = Ar (3.40) 

for both passive and active transformations, and one must rely on context to figure 
out how the equation is to be interpreted. In the active case, one considers the coor- 
dinate system fixed and interprets the matrix A as taking the physical vector r into 
a new vector r', where the components of both are expressed in the same coordinate 
system, just as in (3.39). In the passive case, the physical vector r does not change 
but the basis does, so one interprets the matrix A as taking the components of r 
in the old coordinate system and giving back the components of the same vector 
r in the new (primed) coordinate system, just as in (3.29). All this is illustrated in 
Fig. 3.3. 

Before we get to some examples, note that in the passive transformation (3.29) 
the matrix A takes the old components to the new components, whereas in the active 
transformation (3.39) A takes the new basis vectors to the old ones. Thus when A 
is interpreted actively it corresponds to the opposite transformation as in the pas- 
sive case. This dovetails with the fact that components and basis vectors transform 
oppositely, as discussed under (3.28). 


Example 3.10 Active and passive orthogonal transformations in two dimensions 


Let B — {e \ , ez] be the standard basis for R 2 , and consider a new basis B' given by 


1 


1 


ey = — —e\ H —ei 

sfl V2 


5 If the sleight-of-hand with the primed and unprimed indices in the last couple steps of (3.38) 
bothers you, puzzle it out and see if you can understand it. It may help to note that the prime on an 
index does not change its numerical value, it is just a reminder that it refers to the primed basis. 
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1 1 

g 2 / = —e\ H — — e 2 - 

s/2 s/2 


You can show (as in Example 3.6) that this leads to an orthogonal change of basis 
matrix given by 


A — 


s/2 

V2 

1 

1 

s/2 

V2 


(3.41) 


which corresponds to rotating our basis counterclockwise by <p = 45°, see 
Fig. 3.3(a). 

Now consider the vector r = + y^ e 2 , also depicted in the figure. In the 

standard basis we have 


Wb = 



What does r look like in our new basis? From the figure we see that r is proportional 
to e' v and that is indeed what we find; using (3.29) we have 


[r] b> = AMb = 


s/2 V2 ] j 2V2 ] 


s/2 s/2, 


, 2 ^2 / 



(3.42) 


as expected. Remember that the column vector at the end of (3.42) is expressed in 
the primed basis. 

This was the passive interpretation of (3.40); what about the active interpreta- 
tion? Taking our matrix A and interpreting it as a linear operator represented in the 
standard basis we again have 


e 2 e 2 




(a) (b) 

Fig. 3.3 Illustration of the passive and active interpretations of r' = Ar in two dimensions. In (a) 
we have a passive transformation, in which the same vector r is referred to two different bases. 
The coordinate representation of r transforms as r 1 = Ar, though the vector itself does not change. 
In (b) we have an active transformation, where there is only one basis and the vector r is itself 
transformed by r' = Ar. In the active case the transformation is opposite that of the passive case 
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[r'] B = A[r] B : 


(3.43) 


except that now the vector (1/2, 0) represents the new vector r' in the same basis B. 
This is illustrated in Fig. 3.3(b). As mentioned above, when A is interpreted actively, 
it corresponds to a clockwise rotation, opposite to its interpretation as a passive 
transformation. □ 


Exercise 3.11 Verify (3.41) and (3.42). 

Example 3.11 Active transformations and rigid body motion 

Passive transformations are probably the ones encountered most often in classical 
physics, since a change of cartesian coordinates induces a passive transformation. 
Active transformations do crop up, though, especially in the case of rigid body 
motion. In this scenario, one specifies the orientation of a rigid body by the time- 
dependent orthogonal basis transformation A(t) which relates the space frame K' to 
the body frame K ( t ) (we use here the notation of Example 2.11). As we saw above, 
there corresponds to the time-dependent matrix A(t) a time-dependent linear opera- 
tor U (t) which satisfies U (f)(e,v) = e,- (t). If K and K' were coincident at t — 0 and 
ro is the position vector of a point p of the rigid body at that time (see Fig. 3.4(a)), 
then the position of p at a later time is just r(f) = U (t )ro (see Fig. 3.4(b)), which as 
a matrix equation in K' would read 

[rtf)]*, = A(f)[r 0 ]*'. (3.44) 

In more common and less precise notation this would be written 

r(t) = A(t)r 0 . 

In other words, the position of a specific point on the rigid body at an arbitrary time 
t is given by the active transformation corresponding to the matrix A(t). □ 

Example 3.12 Active and passive transformations and the Schrodinger and Heisen- 
berg pictures 

The duality between passive and active transformations is also present in quantum 
mechanics. In the Schrodinger picture, one considers observables like the momen- 
tum or position operator as acting on the state ket while the basis kets remain fixed. 
This is the active viewpoint. In the Heisenberg picture, however, one considers the 
state ket to be fixed and considers the observables to be time-dependent (recall that 
(2.12) is the equation of motion for these operators). Since the operators are time- 
dependent, their eigenvectors (which form a basis 6 ) are time-dependent as well, so 
this picture is the passive one in which the vectors do not change but the basis does. 


6 For details on why the eigenvectors of Hermitian operators form a basis, at least in the finite- 
dimensional case, see Hoffman and Kunze [10]. 
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(a) Time t = 0 


(b) Time t 


Fig. 3.4 In (a) we have the coincident body and space frames at t = 0, along with the point p of 
the rigid body. In (b) we have the rotated rigid body, and the vector r (t) pointing to point p now 
has different components in the space frame, given by (3.44) 

Just as an equation like (3.40) can be interpreted in both the active and passive sense, 
a quantum mechanical equation like 


<x(t)> = (t/r| (t/^Jct/) |t/r) 

= (W\U*)x(U\f)), 


(3.45) 

(3.46) 


where U is the time-evolution operator for time t, can also be interpreted in two 
ways: in the active sense of (3.46), in which the Us act on the vectors and change 
them into new vectors, and in the passive sense of (3.45), where the U s act on the 
operator x by a similarity transformation to turn it into a new operator, x(t). 


3.4 The Tensor Product — Definition and Properties 

One of the most basic operations with tensors, again commonplace in physics but 
often unacknowledged (or, at best, dealt with in an ad hoc fashion) is that of the ten- 
sor product. Before giving the precise definition, which takes a little getting used to, 
we give a rough, heuristic description. Given two finite-dimensional vector spaces 
V and W (over the same set of scalars C), we would like to construct a product vec- 
tor space, which we denote V ® W, whose elements are in some sense ‘products’ 
of vectors v e V and w eW. We denote these products by v <%> w. This product, like 
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any respectable product, should be bilinear in the sense that 

(i>i + V 2 ) 0 w — v\ <g> w + V 2 ® w (3.47) 

v <g> (w 1 + W 2 ) = v <g> w\ + v 0 W 2 (3.48) 

c(v ® w) — (cv) <S> vo = v ® (no), ceC. (3.49) 

Given these properties, the product of any two arbitrary vectors v and w can then be 

expanded in terms of bases {e, },=i „ and {fj}j= 1 m for V and W as 

v ® w — ( v'e /) ® (w J fj) 

= v'w J ej 0 fj 

so {e,- ® fj }, i = 1, . . . , n, j = 1, . . . , m should be a basis for V <E> W, which would 
then have dimension nm . Thus the basis for the product space would be just the 
product of the basis vectors, and the dimension of the product space would be just 
the product of the dimensions. 

Now we make this precise. Given two finite-dimensional vector spaces V and W, 
we define their tensor product V <S) W to be the set of all C - valued bilinear functions 
on V* x W*. Such functions do form a vector space, as you can easily check. This 
definition may seem unexpected or counterintuitive at first, but you will soon see 
that this definition does yield the vector space described above. Also, given two 
vectors v e V, w e W, we define their tensor product v <E) w to be the element of 
V <S> W defined as follows: 


(i> ® w)(h, g) = v(/i)w(g) Wh e V* , g e W*. 


(3.50) 


(Remember that an element of V ® W is a bilinear function on V* x W*, and so 
is defined by its action on a pair (h, ,£» ) e V* x IT*.) The bilinearity of the tensor 
product is immediate and you can probably verify it without writing anything down: 
just check that both sides of (3.47)— (3.49) are equal when evaluated on any pair 
of dual vectors. To prove that {e,- ® fj), i = 1 ,...,«, j = l, ... ,m is a basis for 

V ® W, let {e'}i = 1 n , {f J ]i — 1 m be the corresponding dual bases and consider 

an arbitrary T e V (g> W. Using bilinearity, 


T(h, g) = h igj T(e i , fj) = higjT ij (3.51) 

where T ,J = T ( e l If we consider the expression T ,J e,- ® f j , then 

{rh^®fj){e k ,f l ) = r-’e l {e k )f ] (f l ) 

= T'j 8j k 8j l 

_ jkl 


so T' J ej ® f j agrees with T on basis vectors, hence on all vectors by bilinear- 
ity, so T — T' J ej ® fj. Since T was an arbitrary element of V ® W, V — 
Spanfe, ® fj}. Furthermore, the e,- ® fj are linearly independent as you should 
check, so {e, ® fj} is actually a basis for V ® W and V ® W thus has dimension mn. 
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The tensor product has a couple of important properties besides bilinearity. First, 
it commutes with taking duals , that is, 

(F ® W)* = V* <g> W*. (3.52) 

Secondly, and more importantly, the tensor product it is associative, i.e. for vector 
spaces Vi, / = 1,2,3, 


(Vl ® V 2 ) ® V 3 = Vi ® (V 2 ® V 3 ). (3.53) 

This property allows us to drop the parentheses and write expressions like 
Vi <g> ■ ■ • <g> V„ without ambiguity. One can think of Vi <g> • • ■ (g> V n as the set of 
C- valued multilinear functions on V* x • ■ ■ x V*. 

These two properties are both plausible, particularly when thought of in terms 
of basis vectors, but verifying them rigorously turns out to be slightly tedious. See 
Warner [18] for proofs and further details. 

Exercise 3.12 If {e;}, {/)} and {gjd are bases for Vi, V 2 and V3, respectively, convince 
yourself that { e ; ® fj (g> gjd is a basis for Vi ® V2 ® V3, and hence that dim Vi ® V2 ® V3 = 
H1H2H3 where dim V,- = n; . Extend the above to n-fold tensor products. 


3.5 Tensor Products of V and V* 

In the previous section we defined the tensor product for two arbitrary vector spaces 
V and W. Often, though, we will be interested in just the iterated tensor product of 
a vector space and its dual, i.e. in tensor products of the form 

y* <g> ••• <g> y*<g> v <g> • •• <g> v . (3.54) 

^ V v 

r times s times 

This space is of particular interest because it is actually identical to 7/ ! In fact, from 
the previous section we know that the vector space in (3.54) can be interpreted as 
the set of multilinear functions on 

y x • ■ ■ x y x y* x • • • x y *, 

V J ' v— ' 

r times s times 

but these functions are exactly T s r . Since the space in (3.54) has basis £>[ = 
[e !1 ® ® e lr ® ej t (8) • • • <g) ej s }, we can conclude that B‘ s is a basis for T[ . In 

fact, we claim that if T eTf has components 7/ ] ., r /l ■‘■ ,I , then 


T = Ti l ,„i r jl - js e‘ l <g> • ■ • <g> e' r ® e^ ® • ■ • <g> ej s 


(3.55) 


is the expansion of T in the basis B r s . To prove this, we just need to check that 
both sides agree when evaluated on an arbitrary set of basis vectors; on the left hand 
side we get T (e^ , . . . , e,- r , e Ji , . . . , e Js ) — 7} 1 ...., i y /I "' Js by definition, and on the right 
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hand side we have 



_ T . jl—ji 


(3.56) 


so our claim is true. Thus, for instance, a (2, 0) tensor like the Minkowski met- 
ric can be written as rj — rj^ v e^ <g> e v . Conversely, a tensor product like / <g> g — 
figje' ® e- 7 e thus has components (/ ® g)ij — figj. Notice that we now have 
two ways of thinking about components: either as the values of the tensor on sets 
of basis vectors (as in (3.5)) or as the expansion coefficients in the given basis (as 
in (3.55)). This duplicity of perspective was pointed out in the case of vectors just 
above Exercise 2. 10, and it is essential that you be comfortable thinking about com- 
ponents in either way. 

Exercise 3.13 Compute the dimension of TJ . 

Exercise 3.14 Let T i and 73 be tensors of type (ly , xi ) and (r 2 , S 2 ) , respectively, on a vector 
space V . Show that T\ (g> 73 can be viewed as an (r\ + r%, si + S2) tensor, so that the tensor 
product of two tensors is again a tensor, justifying the nomenclature. 

One important operation on tensors which we are now in a position to discuss is 
that of contraction, which is the generalization of the trace functional to tensors of 
arbitrary rank: Given T e T s r ( V ) with expansion 


.^n -js e n ® ® e lr ® e/j ® ® e ; - s 


(3.57) 


we can define a contraction of T to be any (r — 1,5 — 1) tensor resulting from 
feeding e' into one of the arguments, e, into another and then summing over i as 
implied by the summation convention. For instance, if we feed e, into the rth slot 
and e 1 into the ( r + j)th slot and sum, we get the (r — 1,5— 1 ) tensor 7 defined as 

T (in, . . . , V r -\, f\ , . . . , fs-l) = T(y \ , . . . , V r -u G , /l , • • • , fs-1, e‘) . 

You may be suspicious that T depends on our choice of basis, but Exercise 3.15 
shows that contraction is in fact well-defined. Notice that the components of T are 


jl—js-l — T 

— Ml 1/ 


Similar contractions can be performed on any two arguments of T provided one ar- 
gument eats vectors and the other dual vectors. In terms of components, a contrac- 
tion can be taken with respect to any pair of indices provided that one is covariant 
and the other contravariant. If we are working on a vector space equipped with a 
metric g, then we can use the metric to raise and lower indices and so can contract 
on any pair of indices, even if they are both covariant or contravariant. For instance, 
we can contract a (2, 0) tensor T with components T, j as T — T, 1 = g IJ Tjj, which 
one can interpret as just the trace of the associated linear operator (or (1 , 1) tensor). 
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For a linear operator or any other rank 2 tensor, this is the only option for contrac- 
tion. If we have two linear operators A and B, then their tensor product A ® B e T 2 2 
has components 

(A® B) ik J l = Ai> B k l , 

and contracting on the first and last index gives a (1, 1) tensor AB whose compo- 
nents are 

(AB) k j = V B k l . 

You should check that this tensor is just the composition of A and B, as our nota- 
tion suggests. What linear operator do we get if we consider the other contraction 
Ar'B/1 

Exercise 3.15 Show that if {e;};=i,..., n and {e, are two arbitrary bases that 

r(ui, . ... iv_i ,e/, /i, .. fs-i, e‘) = T(vi, .... iv-i, e,-/, /i /,-i, e 1 ') 

so that contraction is well-defined. 


Example 3.13 V* 0 V 

One of the most important examples of tensor products of the form (3.54) is V * ® V , 
which as we mentioned is the same as Tj 1 , the space of linear operators. How does 
this identification work, explicitly? Well, given / (g> v e V* ® V, we can define a 
linear operator by (/ ® v)(w) = f(w)v. More generally, given 

Ti-ie 1 ®ej eV* ®V, (3.58) 

we can define a linear operator T by 

T(v) = Ti j e\v)ej = t /Tr’ej 

which is identical to (2.13). This identification of V* ® V and linear operators is 
actually implicit in many quantum mechanical expressions. Let H be a quantum 
mechanical Hilbert space and let \//,(f> eH so that L(<p) e TL*. The tensor product 
of L(4>) and i//, which we would write as L(</>) <B) \!f , i s written in Dirac notation as 
\ifr)(<p\. If we are given an orthonormal basis B — { | f ) } , the expansion (3.58) of an 
arbitrary operator H can be written in Dirac notation as 

H = J2 H tj \i)U\, 

i.j 

an expression which may be familiar from advanced quantum mechanics texts. 7 In 
particular, the identity operator can be written as 

/ = £>■><!!, 
i 

which is referred to as the resolution of the identity with respect to the basis { | / ) } . 


7 We do not bother here with index positions since most quantum mechanics texts do not employ 
Einstein summation convention, preferring instead to explicitly indicate summation. 
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A word about nomenclature: In quantum mechanics and other contexts the tensor 
product is often referred to as the direct or outer product. This last term is meant 
to distinguish it from the inner product, since both the outer and inner products eat 
a dual vector and a vector (strictly speaking the inner product eats 2 vectors, but 
remember that with an inner product we may identify vectors and dual vectors) but 
the outer product yields a linear operator whereas the inner product yields a scalar. 

Exercise 3.16 Interpret e' ® ej as a linear operator, and convince yourself that its matrix 
representation is 



Recall that E j, is one of the elementary basis matrices introduced way back in Example 2.8, 
and has a 1 in the y'th row and i th column and zeros everywhere else. 


3.6 Applications of the Tensor Product in Classical Physics 

Example 3.14 Moment of inertia tensor revisited 

We took an abstract look at the moment of inertia tensor in Example 3.3; now, 
armed with the tensor product, we can examine the moment of inertia tensor more 
concretely. Consider a rigid body with a fixed point O, so that it has only rotational 
degrees of freedom ( O need not necessarily be the center of mass). Let O be the 
origin, pick time-dependent body-fixed axes K = jx(f), y(f), z (f)} for R 3 , and let g 
denote the Euclidean metric on M 3 . Recall that g allows us to define a map L from 
M 3 to R 3 *. Also, let the /th particle in the rigid body have mass m,- and position 
vector r, with = (x,-, yi , Zi) relative to O, and let rf = g( r,-, r,). All this is 
illustrated in Fig. 3.5. The (2, 0) moment of inertia tensor is then given by 



(3.59) 


while the (1,1) tensor reads 



(3.60) 


You should check that in components (3.59) reads 


3- jk — 'y ' m i ifi &jk ( r ; ) j ( r ; )k) ■ 


Writing a couple of components explicitly yields 


?xx = X >'(>’< 2 + z i ) 



(3.61) 
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Fig. 3.5 The rigid body with fixed point O and body-fixed axes K = {x, y, z}, along with / th 
particle at position r, with mass m, 


expressions which should be familiar from classical mechanics. So long as the basis 
is orthonormal, the components Ij k of the (1, 1) tensor in (3.60) will be the same 
as for the (2, 0) tensor, as remarked earlier. Note that if we had not used body-fixed 
axes, the components of r,- (and hence the components of I, by (3.61)) would in 
general be time-dependent; this is the main reason for using the body-fixed axes in 
computation. 

Example 3.15 Maxwell stress tensor 

In considering the conservation of total momentum (mechanical plus electromag- 
netic) in classical electrodynamics one encounters the symmetric rank 2 Maxwell 
Stress Tensor , defined in (2, 0) form as 8 

1 

7(2,0) = E®E + B®B — - (E • E + B • B)g 

where E and B are the dual vector versions of the electric and magnetic field vec- 
tors. T can be interpreted in the following way: T ( v , w) gives the rate at which 
momentum in the u-direction flows in the w-direction. In components we have 

Tij = Et Ej + B, B j — ~ (E ■ E + B • B )S U , 
which is the expression found in most classical electrodynamics textbooks. 


'Recall that we have set all physical constants such as c and eg equal to 1. 
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Example 3.16 The electromagnetic field tensor 


As you has probably seen in discussions of relativistic electrodynamics, the electric 
and magnetic field vectors are properly viewed as components of a rank 2 antisym- 
metric tensor F, the electromagnetic field tensor. 9 To write F in component-free 
notation requires machinery outside the scope of this text, 10 so we settle for its ex- 
pression as a matrix in an orthonormal basis, which in (2, 0) form is 


[E a ,o>] = 


The Lorentz force law 


/ 0 

Ex 

Ey 

E z \ 

-E x 

0 

~B Z 

By 

~Ey 

B z 

0 

~B X 

\~ E z 

-By 

B x 

0 / 

dp' 1 

— riF^ 

v v 


dr 

— H r v 

U 



(3.62) 


where p — mv is the 4-momentum of a particle, v is its 4-velocity and q its charge, 
can be rewritten without components as 


— = qF(i, i)(u) (3.63) 

dr 

which just says that the Minkowski force ^ on a particle is given by the action of 
the field tensor on the particle’s 4-velocity! 


3.7 Applications of the Tensor Product in Quantum Physics 

In this section we will discuss further applications of the tensor product in quantum 
mechanics, in particular the oft-unwritten rule that to add degrees of freedom one 
should take the tensor product of the corresponding Hilbert spaces. Before we get 
to this, however, we must set up a little more machinery and address an issue that 
we have so far swept under the rug. The issue is that when dealing with spatial 
degrees of freedom, as opposed to ‘internal’ degrees of freedom like spin, we often 
encounter Hilbert spaces like i 2 ([— a, a]) and L 2 (R) which are most conveniently 
described by ‘basis’ vectors which are eigenvectors of either the position operator x 
or the momentum operator p. The trouble with these bases is that they are often non- 
denumerably infinite (i.e. cannot be indexed by the integers, unlike all the bases we 
have worked with so far) and, what is worse, the ‘basis vectors’ do not even belong 


9 In this example and the one above we are actually not dealing with tensors but with tensor fields, 
i.e. tensor-valued functions on space and spacetime. For the discussion here, however, we will 
ignore the spatial dependence of the fields, focusing instead on the tensorial properties. 

10 One needs the exterior derivative, a generalization of the curl, divergence and gradient operators 
from vector calculus. See Schutz [15] for a very readable account. 
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to the Hilbert space! Consider, for example, L 2 (R). The position operator x acts on 
functions t/r(x) e L 2 (R) by 

xt/r(x) = xt/r(x). (3.64) 

If we follow the practice of most quantum mechanics texts and treat the Dirac delta 
functional <5 as L(S(x)) where 5(x), the ‘Dirac delta function’, is infinite at 0 and 0 
elsewhere, you can check (see Exercise 3.17) that 

xS(x — xq) = xqS (x — xq) 

so that <S(x — xo) is an ‘eigenfunction’ of x with eigenvalue xo (in Dirac notation we 
write the corresponding ket as |x<j)). The trouble is that, as we saw in Example 2.23, 
there is no such <5(x) e L 2 (R)\ Furthermore, since the basis {<5(x — xo)} xo <=ir is in- 
dexed by K and not some subset of Z, we must expand i jr e L 2 (R) by integrating 
instead of summing. Integration, however, is a limiting procedure and one should 
really worry about what it means for an integral to converge. Rectifying all this in 
a rigorous manner is possible, 11 but outside the scope of this text, unfortunately. 
We do wish to work with these objects, however, so we will content ourselves with 
the traditional approach: ignore the fact that the delta functions are not elements of 
L 2 (R), work without discomfort with the basis {<5(x — xo)}j; oe R, 12 and fearlessly 
expand arbitrary functions i jr in the basis {<5(x — xo)}.r 0 eR as 

/ OO 

dx' V f (x , )5(x — x'), (3.65) 

-CO 

where the above equation can be interpreted both as the expansion of ijr and just the 
definition of the delta function. In Dirac notation (3.65) reads 

/ CO 

dx' \jr(x'} \x'). (3.66) 

-CO 

Note that we can think of the numbers i jr(x) as the components of \ijr) with respect to 
the basis {|x)} xs r. Alternatively, if we define the inner product of our basis vectors 
to be 


(x|x') = <5(x — x') 

as is usually done, then using (3.66) we have 

x[r( x ) = <x|i/r> (3.67) 

which gives another interpretation of i//(x). These two interpretations of i//(x) are 
just the ones mentioned below (3.56); that is, the components of a vector can be 
interpreted either as expansion coefficients, as in (3.66), or as the value of a given 
dual vector on the vector, as in (3.67). 


n This requires the so-called 'rigged' Hilbert space; see Ballentine [2]. 

12 Working with the momentum eigenfunctions e ,px instead does not help; though these are legiti- 
mate functions, they still are not square-integrable since \e ,px | 2 dx = oo! 
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Exercise 3.17 By considering the integral 



S (x — xo ) ) / ( x ) dx 


(where / is an arbitrary square-integrable function), show formally that 


x.5(x — xo) = XqB{x — X'o). 

Exercise 3.18 Check that {5(x — xo)}. ro eM satisfies (2.28). 

Exercise 3.19 Verify (3.67). 

While we are at it, let us pose the following question: we mentioned in a foot- 
note on the previous page that one could use momentum eigenfunctions instead of 
position eigenfunctions as a basis for L 2 (R); what does the corresponding change 
of basis look like? 


Example 3.17 The momentum representation 

As is well-known from quantum mechanics, the eigenfunctions of the momentum 
operator p = —i ^ are the wavefunctions {e' pv }p € R, and these wavefunctions form 
a basis for L 2 ( R). In fact, the expansion of an arbitrary function \jr e L 2 (R) in this 
basis is just the Fourier expansion of t/r, written 

1 r°° 

i'(x) = — dp <p(p)e' px (3.68) 

J _ oo 

where the component function (j> ( p ) is known as the Fourier Transform of t//. One 
could in fact work exclusively with (p(p) instead of i// (x ) , and recast the operators x 
and p in terms of their action on <j>( p) (see Exercise 3.20 below); such an approach 
is known as the momentum representation. Now, what does it look like when we 
switch from the position representation to the momentum representation, i.e. when 
we change bases from {i5(x — xo)}x 0 eK t0 l e ‘ px }peS . ? Since the basis vectors are 
indexed by real numbers p and xo as opposed to integers i and j, our change of basis 
will not be given by a matrix with components A 1 , but rather a function A(x o, p). 
By (3.19) and the fact that both bases are orthonormal, this function is given by 
the inner product of <5(x — xo) and e lpx . In Dirac notation this would be written as 
(xolp), and we have 

/ OO 

dx S(x — xo)e lpx = e' px °, 

-OO 

a familiar equation. □ 

Exercise 3.20 Use (3.68) to show that in the momentum representation, p<p(p ) = pcp(p ) 
and xcp(p) = i^. 

The next issue to address is that of linear operators: having constructed a new 
Hilbert space 13 Hi <S> H 2 out of two Hilbert spaces Hi and H 2 , can we construct 


13 You may have noticed that we defined tensor products only for finite-dimensional spaces. The 
definition can be extended to cover infinite-dimensional Hilbert spaces, but the extra technicalities 


3.7 Applications of the Tensor Product in Quantum Physics 


63 


linear operators onH\® TtS out of the linear operators on H\ and TtS? Well, given 
linear operators A,- on Hi, i — 1, 2, we can define a linear operator Ai ® A 2 on 
H\ <g> H2 by 


(Ai <g> A 2 )( v 0w) = (A in) <g> (A 2 w). (3.69) 

You can check that with this definition, (A ® B)(C ® D) = AC ® fiD. In most 
quantum mechanical applications either A 1 or A 2 is the identity, i.e. one considers 
operators of the form Aj ® / or I ® A 2 . These are often abbreviated as Ai and A 2 
even though they are acting on H \ ® TlS. We should also mention here that the inner 
product (• | •)$ on H\ ® H2 is just the product of the inner products on (• | •),- on 
the Hi , that is, 


(ui ® V2\wi <g> W 2 )® = (ui, uq)i • (l> 2 , W 2 ) 2 . 

The last subject we should touch upon is that of vector operators, which are 
defined to be sets of operators that transform as three-dimensional vectors under the 
adjoint action of the total angular momentum operators J , . That is, a vector operator 
is a set of operators {/?/ },=i 3 (often written collectively as B) that satisfies 

3 

ad j, ( Bj ) = [Ji , Bj] = i J2 tijkBk (3.70) 

k= 1 

where €jjk is the familiar Levi-Civita symbol. The three-dimensional position op- 
erator ? = [x,y,z], momentum operator p = \p x , p y , /3- } , and orbital angular mo- 
mentum operator L = {L x , L y , L z } are all vector operators, as you can check. 

Exercise 3.21 For spinless particles, J = L = x x p. Expressions for the components may 
be obtained by expanding the cross product or referencing Example 2.15 and Exercise 2.9. 
Use these expressions and the canonical commutation relations [je, , pj] = —i&ij to show 
that x, p and L are all vector operators. 

Now we are finally ready to consider some examples, in which we will take as an 
axiom that adding degrees of freedom is implemented by taking tensor products 
of the corresponding Hilbert spaces. You will see that this process reproduces 
familiar results. 

Example 3.18 Addition of translational degrees of freedom 

Consider a spinless particle constrained to move in one dimension; the quantum 
mechanical Hilbert space for this system is L 2 (R) with basis {|x)} as r. If we con- 
sider a second dimension, call it the y dimension, then this degree of freedom has 
its own Hilbert space L 2 (M.) with basis {|y)}ysR- If we allow the particle both de- 
grees of freedom then the Hilbert space for the system is L 2 Q R) ® L 2 (K), with basis 
{|x} ® |y)}j;,ysR. An arbitrary ket \ f) e L 2 (R) <g> L 2 (]R) has expansion 


needed do not add any insight to what we are trying to do here, so we omit them. The theory of 
infinite-dimensional Hilbert spaces falls under the rubric of functional analysis, and details can be 
found, for example, in Reed and Simon [12]. 
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\ty) 


-n 

J — oo J — oo 


r/x rfv t/r (x, y)|x) ® |y) 


with expansion coefficients i/r(x, y). If we iterate this logic, we get in 3 dimensions 

*00 /»00 no O 


\f) = 


/ OO r OO p 

-oo J —oo J — oo 


dx dy dzir(x, y, z)\x) ® |y) <g> |z). 


If we rewrite i/r(x, y, z) as x/r (r) and |x) <8> |y) <8> |z) as |r> where r = (x, y, z), then 
we have 


|i /r) = y r/ 3 r x// (r) |r> 

which is the familiar expansion of a ket in terms of three-dimensional position eigen- 
kets. Such a ket is an element of L 2 (R) <g> L 2 (R) <g> L 2 (K), which is also denoted as 
L 2 (R 3 ). 14 


Example 3.19 Two-particle systems 

Now consider two spinless particles in three-dimensional space, possibly interacting 
through some sort of potential. The two-body problem with a 1/r potential is a clas- 
sic example of this. The Hilbert space for such a system is then L 2 (R 3 ) ® L 2 (R 3 ), 
with basis { |rj ) ® I r 2 ) } r , eK 3 • man Y textbooks the tensor product symbol is omit- 
ted and such basis vectors are written as |ri)|r 2 > or even |r i , r 2 ). A ket |i/r> in this 
Hilbert space then has expansion 

\f) = J d 3 ri J d i r2 VK r i, r 2 )|r, , r 2 ) 

which is the familiar expansion of a ket in a two-particle Hilbert space. One can 
interpret , rz) as the probability amplitude of finding particle 1 in position ri 
and particle 2 in position V 2 simultaneously. 

Example 3.20 Addition of orbital and spin angular momentum 

Now consider a spin 5 particle in three dimensions. As remarked in Example 2.2, 
the ket space corresponding to the spin degree of freedom is C 2s+1 , and one usually 
takes a basis {|m)}_ s < m < i of S- eigenvectors with eigenvalue m. The total Hilbert 
space for this system is L 2 (]R 3 ) ® C 2s+1 , and we can take as a basis { |r) ® | m)} 
where reR 3 and —s<m<s. Again, the basis vectors are often written as |r) | m) 
or even |r, m). An arbitrary ket |i [r) then has expansion 


14 L 2 (R 3 ) is actually defined to be the set of all square-integrable functions on R 3 , i.e. functions / 
satisfying 



dx dy dz |/| 2 < oo. 


Not too surprisingly, this space turns out to be identical to L 2 (R) ® L 2 (R) ® L 2 (R). 


3.7 Applications of the Tensor Product in Quantum Physics 


65 


|t/Q=^ / d 3 r\Js m (r)\r,m) 

m=—s ^ 

where i j/ m (r) is the probability of hnding the particle at position r and with m units 
of spin angular momentum in the z-direction. These wavefunctions are sometimes 
written in column vector form 

/ ir s \ 

i/L- 1 


'I'-s+l 

V *-s 

The total angular momentum operator J is given byL®/ + /®S where L is 
the orbital angular momentum operator. One might wonder why J is not given by 
L ® S; there is a good answer to this question, but it requires delving into the (fas- 
cinating) subject of Lie groups and Lie algebras, which we postpone until part II. In 
the meantime, you can get a partial answer by checking (Exercise 3.22 below) that 
the operators L, ® .S', do not satisfy the angular momentum commutation relations 
whereas the L, ® / + I ® .S, do. 

Exercise 3.22 Check that 

3 

[Li 0 / + / 0 Si , L j 0 / + / (8) Sj] = ^ijk(L k 0 / + / 0 S k ). 


[Lj 0 Si , L j 0 Sj] ^ ^ ' €ij k L k 0 S k . 

k= 1 

Be sure to use the bilinearity of the tensor product carefully. 

Example 3.21 Addition of spin angular momentum 

Next consider two particles of spin ,V| and s 2 , respectively, fixed in space so that 
they have no translational degrees of freedom. The Hilbert space for this system is 
C 2il + 1 ® C 2i2+1 , with basis {| m\) <g> \m 2 )} where — st < m,- < Si, i = 1, 2. Again, 
such tensor product kets are usually abbreviated as \m\)\m 2 ) or \m\,mf). There are 
several important linear operators on C 2il+1 <g> C 2j2+1 : 

• Si <g> I Vector spin operator on first particle 

• I <S> S 2 Vector spin operator on second particle 

• S = S| 0 / + / 0 S 2 Total vector spin operator 

• S 2 = ffj Si S, Total spin squared operator 

(Why aren’t S 2 and S 3 in our list above?) The vectors \tn 1 , mi) are clearly eigen- 
vectors of .Sj - and S 2z and hence S z (we abuse notation as mentioned below (3.70)) 
but, as you will show in Exercise 3.23, they are not necessarily eigenvectors of S 2 . 
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However, since the .S', obey the angular momentum commutation relations (as you 
can check), the general theory of angular momentum tells us that we can find a basis 
for C 2il+1 ® C 2jz+1 consisting of eigenvectors of S z and S 2 . Furthermore, it can be 
shown that the S 2 eigenvalues that occur are s(s + 1 ) where 


s = |si — S2I, |st — S2I + 1 , • ■ • , H +S2 ( 3 . 71 ) 

and for a given 5 the possible S z eigenvalues are m where —s <m<s as usual (see 
Sakurai [14] for details). We will write these basis kets as {|s, m)} where the above 
restrictions on s and m are understood, and where we physically interpret {|s, m)} as 
a state with total angular momentum equal to *Js (s + 1 ) and with m units of angular 
momentum pointing along the z-axis. Thus we have two natural and useful bases for 
C 2si+1 ®C 2s2+1 : 

B = { |m 1 , m2)}, — ST<mi<si, —S2<m2<si 

B'={\s,m)}, kt — $2] < J < si + J 2 1 —s<m<s. 

What does the transformation between these two bases look like? Well, by their 
definition, the A 1 ., relating the two bases are given by e- / (e,v); using s, m collectively 
in lieu of the primed index and m 1 , m 2 collectively in lieu of the unprimed index, 
we have, in Dirac notation, 


= (3.72) 

These numbers, the notation for which varies widely throughout the literature, are 
known as Clebsch-Gordan Coefficients . Methods for computing them can be found 
in any standard quantum mechanics textbook. 

Let us illustrate the foregoing with an example. Take two spin 1 particles, so that 
.V] = .s ’2 = 1 . The Hilbert space for the first particle is C 3 , with Si, eigenvector basis 
{| — 1), |0), 1 1)}, and so the two-particle system has nine-dimensional Hilbert space 
C 3 ® C 3 with corresponding basis 

B={\i)\j)\iJ = -1,0,1} 

= { 1 1> 1 1> , 1 1> |0> , 1 1) I — 1) , |0> 1 1> , etc.} . 

There should also be another basis consisting of S z and S 2 eigenvectors, however. 
From (3.71) we know that the possible s values are ,v = 0, 1, 2, and it is a standard 
exercise in angular momentum theory to show that the nine (normalized) S z and S 2 
eigenvectors are 

|s = 2, = 2) = 1 1> 1 1) 

|2,1> = -J=(|1>|0) + |0)|1>) 

|2,0> = -J=(|1>|-1) + 2|0>|0> + |-1)|1)) 

| 2,-i > =-J=(|- 1 )|0 > + | 0 > |- 1 )) 
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| 2 , — 2 } = | — 1 )| — 1 ) 

|1,1> = -J=(|1>|0>-|0)|1» 

1 1 . 0) = — (l 1)1 — 1> — I — 1> 1 1>) 

| 1 ,- 1 > = 4 =(| 0 >|- 1 >-|- 1 >| 0 » 

| 0 , 0 > = -^(| 0 )| 0 >- | 1 >|- 1 > -|- 1 )| 1 >). 

These vectors can be found using standard techniques from the theory of ‘addition 
of angular momentum’; for details see Gasiorowicz [5] or Sakurai [14]. The coef- 
ficients appearing on the right hand side of the above equations are precisely the 
Clebsch-Gordan coefficients (3.72), as a moment’s thought should show. 

Exercise 3.23 Show that 

S 2 = S? ® / + /®S? + 2^S l; ® S 2i . 

i 

The right hand side of the above equation is usually abbreviated as S 2 + S? + 2Si ■ S 2 . Use 
this to show that | m 1 , mn) is not generally an eigenvector of S 2 . 

Example 3.22 Entanglement 

Consider two Hilbert spaces 77 \ and 77 2 and their tensor product 77 1 ® 'Hi. Only 
some of the vectors in 77 1 ® 772 can be written as 1 /r <g> </>; such vectors are referred 
to as separable states or product states. All other vectors must be written as linear 
combinations of the form 1 //; ® </>,■ , and these vectors are said to be entangled, 
since in this case the measurement of the degrees of freedom represented by 77 1 
will influence the measurement of the degrees of freedom represented by 77 t- The 
classic example of an entangled state comes from the previous example of two fixed 
particles with spin; taking ,V| — sj — 1/2 and writing the standard basis for C 2 as 
{|+), |— )}, we consider the particular state 

l+)|-)-|-)l+). (3-73) 

If an observer measures the first particle to be spin up then a measurement of the 
second particle’s spin is guaranteed to be spin-down, and vice versa, so measuring 
one part of the system affects what one will measure for the other part. This is the 
sense in which the system is entangled. For a product state x/r <g> </>, there is no such 
entanglement: a particular measurement of the first particle cannot affect what one 
measures for the second, since the second particle’s state will be c/i no matter what. 
You will check below that (3.73) is not a product state. 

Exercise 3.24 Prove that (3.73) cannot be written as i/s (g> tp for any \[r. <j> s C 2 . Do this by 
expanding xp and <f> in the given basis and showing that no choice of expansion coefficients 
for t/r and ip will yield (3.73). 


68 


3 Tensors 


3.8 Symmetric Tensors 

Given a vector space V there are certain subspaces of T^(V) and T^(V) which 
are of particular interest: the symmetric and antisymmetric tensors. We will discuss 
symmetric tensors in this section and antisymmetric tensors in the next. A symmet- 
ric (r, 0) tensor is an (r. 0) tensor whose value is unaffected by the interchange (or 
transposition ) of any two of its arguments, that is, 

for any i and j. Symmetric (0, r) tensors are defined similarly. You can easily check 
that the symmetric (r, 0) and (0, r) tensors each form vector spaces, denoted S r (V*) 
and S' (V), respectively. For T e S' (V*), the symmetry condition implies that the 
components 7] 1 .../ r are invariant under the transposition of any two indices, hence 
invariant under any rearrangement of the indices (since any rearrangement can be 
obtained via successive transpositions). Similar remarks apply, of course, to S' (V). 
Notice that for rank 2 tensors, the symmetry condition implies Tjj = 7’ /( so that 
[T\b for any B is a symmetric matrix. Also note that it does not mean anything to 
say that a linear operator is symmetric, since a linear operator is a (1, 1) tensor and 
there is no way of transposing the arguments. One might find that the matrix of a 
linear operator is symmetric in a certain basis, but this will not necessarily be true 
in other bases. If we have a metric to raise and lower indices then we can, of course, 
speak of symmetry by turning our linear operator into a (2, 0) or (0, 2) tensor. 

Example 3.23 S 2 (R 2 *) 

Consider the set {e l <8> e 1 , e 2 ® e 2 , e 1 ® e 2 + e 2 ® e 1 } C S 2 (R 2 *) where {e 1 };=i ,2 is 
the standard dual basis. You can check that this set is linearly independent, and that 
any symmetric tensor can be written as 

T = T\\e l ® e l + T^e 2 <g> e 2 + Tn(e l ® e 2 + e 2 <g> e 1 ) (3.74) 

so this set is a basis for S 2 (K 2 *), which is thus three-dimensional. In particular, the 
Euclidean metric g on R 2 can be written as 

g = e l ® e l + e 2 0 e 2 

since gn = g 22 = 1 and gu = gn = 0. Note that g would not take this simple form 
in a non-orthonormal basis. □ 

Exercise 3.25 Let V = R" with the standard basis B. Convince yourself that 

[e; ® ej + ej®ei] B = Sij 
where is the symmetric matrix defined in Example 2.8. 

There are many symmetric tensors in physics, almost all of them of rank 2. Many 
of them we have met already: the Euclidean metric on ]R 3 , the Minkowski metric 
on M 4 . the moment of inertia tensor, and the Maxwell stress tensor. You should refer 
to the examples and check that these are all symmetric tensors. We have also met 
one class of higher rank symmetric tensors: the multipole moments. 



3.8 Symmetric Tensors 


69 


Example 3.24 Multipole moments and harmonic polynomials 


Recall from Example 3.4 that the scalar potential <t>(r) of a charge distribution p ( r' ) 
localized around the origin in K 3 can be expanded in a Taylor series in 1 / r as 


0>(r) = 


1 

4t r 


Go <2i(r) 

■ 1 


1 62 (r,r) 

2! r 5 


1 G 3 (r,r,r) 

3! r 1 


where the Qi are the symmetric rank l multipole moment tensors. Each symmetric 
tensor Q\ can be interpreted as a degree / polynomial //, just by evaluating on l 
copies of r = (x 1 , x 2 , x 3 ) as indicated: 


Mr) = G;( r, ■ . • ,r) = Q h ... ilX h ■ --M, (3.75) 

where the indices ij are the usual component indices, not exponents. Note that the 
expression x' 1 • • • x H in the right hand side is invariant under any rearrangement of 
the indices ij. This is because we fed in l copies of the same vector r into Q[. This 
fits in nicely with the symmetry of Q/ t . In fact, the above equation gives a one-to- 
one correspondence between I th rank symmetric tensors and degree / polynomials; 
we will not prove this correspondence here, but it should not be too hard to see 
that (3.75) turns any symmetric tensor into a polynomial, and that, conversely, any 
fixed degree polynomial can be written in the form of the right hand side of (3.75) 
with Qi v ..i, symmetric. This (roughly) explains why the multipole moments are 
symmetric tensors: the multipole moments are really just fixed degree polynomials, 
which in turn correspond to symmetric tensors. 

What about the tracelessness of the Qi , i.e. the fact that Qi r - k - k -i„ — 0? 
Well, <t>(r) obeys the Laplace equation A$(r) = 0, which means that every term in 
the expansion must also obey the Laplace equation. Each term is of the form 

Mr) 

r 2l+l ’ 

and if we write the polynomial //( r) as r 1 Y (0 . <p) then a quick computation shows 
that Y (0, <p) must be a spherical harmonic of degree / and hence // must be a har- 
monic polynomial! Expanding //( r) in the form (3.75) and applying the Laplacian 
then shows that if fi is harmonic, then Qi must be traceless. □ 

Exercise 3.26 What is the polynomial associated to the Euclidean metric tensor g = 

Xj = i e> ® e 'l What is the symmetric tensor in S 3 (R 3 ) associated to the polynomial x 2 y ? 

Exercise 3.27 Substitute /)( r) = r 1 Y (6, (f>) into the equation 



and show that Y(9,<p) must be a spherical harmonic of degree l. Then use (3.75) to show that 
if // is a harmonic polynomial, then the associated symmetric tensor Q\ must be traceless. 
If you have trouble showing that Q\ is traceless for arbitrary /, try starting with the 1 = 2 
(quadrupole) and / = 3 (octopole) cases. 
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3.9 Antisymmetric Tensors 

Now we turn to antisymmetric tensors. A n antisymmetric (or alternating) (r, 0) ten- 
sor is one whose value changes sign under transposition of any two of its arguments, 

i.e. 


T(vi,...,vi,...,vj,...,v r ) = -T(vi,...,vj,...,vi,..., v r ). (3.77) 

Again, antisymmetric (0, r) tensors are defined similarly and both sets form vector 
spaces, denoted A'V* and A' V (for r — 1 we define A 1 V* — V* and A 1 V — V). 
The following properties of antisymmetric tensors follow directly from (3.77); the 
first one is immediate, and the second two you will prove in Exercise 3.28 below: 


1 . T(v \ , . . . , ry ) = 0 if Vi = vj for any i ^ j 

2. T(v \, . . . , v r ) = 0 if {ui, . . . , v r } is linearly dependent 

3. If dim V — n, then the only tensor in A' V* and A' V for r > n is the 
0 tensor. 


An important operation on antisymmetric tensors is the wedge product : Given 
f,geV* we define the wedge product of / and g, denoted / A g, to be the anti- 
symmetric (2, 0) tensor defined by 


f Ag = f ®g- g® f. 


(3.78) 


Note that f a g = —g A /, and that / A / = 0. Expanding (3.78) in terms of the e l 
gives 


/ A g — figj (e ! ® e j - e j ® e') — figje 1 A e j (3.79) 

so that {e l A e J }i < ; spans all wedge products of dual vectors (note the “i < j ” 
stipulation, since e l A e 1 and e J A e‘ are not linearly independent). In fact, you can 
check that {e 1 A e' j };< ; is linearly independent and spans A 2 V*, hence is a basis 
for A 2 V*. The wedge product can be extended to /--fold products of dual vectors as 
follows: given r dual vectors f\ , . . . , f r , we define their wedge product f\ A • ■ • A f r 
to be the sum of all tensor products of the form /,■, ® ■ ■ ■ ® f tr where each term gets 
a + or a — sign depending on whether an odd or an even number 15 of transpositions 
of the factors are necessary to obtain it from /i ® ■ ■ ■ ® f r ; if the number is odd the 
term is assigned — 1 , if even a + 1 . Thus, 


fi A f 2 = f\ ® f 2 ~ fi ® f\ (3.80) 

/l A f 2 A f 3 = /j ® f 2 ® / 3 + fl ® h ® f\ + /3 ® fl ® fl 

- h® h® f\- h® .fi ® h - h ® h ® fi (3.81) 

and so on. You should convince yourself that [e n A • ■ ■ A e lr )i ] <...< !r is a basis for 

A' V* . Note that this entire construction can be carried out for vectors as well as 


15 The number of transpositions required to get a given rearrangement is not unique, of course, 
but hopefully you can convince your self that it is always odd or always even. A rearrangement 
which always decomposes into an odd number of transpositions is an odd rearrangement , and 
even rearrangements are defined similarly. We will discuss this further in Chap. 4. specifically in 
Example 4.22. 
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dual vectors. Also note that all the comments about symmetry above Example 3.23 
apply here as well. 

Exercise 3.28 Let T eAT. Show that if {t>i, . . . , v r ] is a linearly dependent set then 
T(vi, . . . , v r ) = 0. Use the same logic to show that if {/i , . . . , f r } C V* is linearly depen- 
dent, then /i A • • • A f r = 0. If dim V = n, show that any set of more than n vectors must 
be linearly dependent, so that A r V = A' V* = 0 for r > n . 

Exercise 3.29 Expand the (2, 0) electromagnetic field tensor of (3.62) in the basis {e‘ Ae J ] 
where i < j and i, j = 1 , 2, 3, 4. 

Exercise 3.30 Let dim V = n . Show that the dimension of A r V* and A r V is (") = . 

Example 3.25 Identical particles 


In quantum mechanics we often consider systems which contain identical particles, 
i.e. particles of the same mass, charge and spin. For instance, we might consider n 
non-interacting hydrogen atoms moving in a potential well, or the two electrons of 
the helium atom orbiting around the nucleus. In such cases we would assume that 
the total Hilbert space Htot would be just the «-fold tensor product of the single 
particle Hilbert space H. It turns out, however, that nature does not work that way; 
for certain particles (known as bosons) only states in S n (H) are observed, while for 
other particles (known as fermions) only states in A"H are observed. All known 
particles are either fermions or bosons. This restriction of the total Hilbert space 
to either S n (H) or A n H is known as the symmetrization postulate and has far- 
reaching consequences. For instance, if we have two fermions, we cannot measure 
the same values for a complete set of quantum numbers for both particles, since 
then the state would have to include a term of the form \f)\f) and thus could not 
belong to A 2 H . This fact that two fermions cannot be in the same state is known as 
the Pauli Exclusion Principle. As another example, consider two identical spin 1 /2 
fermions fixed in space, so that Htot — A 2 C 2 . A 2 C 2 is one-dimensional with basis 
vector 


\ 1 \ 

1 \ 

/ 2 / 

2 / 


where we have used the notation of Example 3.21. If we measure S 2 or S- for this 
system we will get 0. This is in marked contrast to the case of two distinguishable 
spin 1/2 fermions; in this case the Hilbert space is C 2 ® C 2 and we have additional 
possible state kets 


| 1 , 1 > = 
| 1 , 0 > = 
| 1 .- 1 > = 


which yield nonzero values for S 2 and S z . 


□ 
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The next two examples are a little more mathematical than physical but they are 
necessary for the discussion of pseudovectors below. Hopefully you will also find 
them of interest in their own right. 

Example 3.26 The Levi-Civita tensor 


Consider R" with the standard inner product. Let {e, }/ = i „ be an orthonormal 

basis for R" and consider the tensor 


e = e 1 A • • • A e n 


e A”! 


You can easily check that 


Cl ...i n 


0 if {ii, . . . , in) contains a repeated index 
— 1 if {ii, . . . , i,,} is an odd rearrangement of {1, ... , n } 
+ 1 if {ii, . . . , i n ] is an even rearrangement of {1, . . . , n). 


For n = 3, you can also check (Exercise 3.31) that etjk has the same values as the 
Levi-Civita symbol, and so e here is an n -dimensional generalization of the three- 
dimensional Levi-Civita tensor we introduced in Example 3.2. As in that example, 
€ should be thought of as eating n vectors and spitting out the n -dimensional volume 
spanned by those vectors. This can be seen explicitly for n = 2 also; considering two 
vectors u and v in the (x, y ) plane, we have 


e(u, v ) = €iju‘v J 


= u x v y - « y v x 


u - V 


= (u X v) z 

and we know that this last expression can be interpreted as the (signed) area of the 
parallelogram spanned by u and v. 

Finally, note that A”R"* is one-dimensional, and that e is the basis for it de- 
scribed under (3.81). 

You may object that our construction of e seems to depend on a choice of met- 
ric and orthonormal basis. The former is true: e does depend on the metric, and 
we make no apologies for that. As to whether it depends on a particular choice of 
orthonormal basis, we must do a little bit of investigating; this will require a brief 
detour into the subject of determinants. 


Exercise 3.31 Check that the e tensor on R 3 satisfies 

1 +1 if {i jk } is a cyclic permutation l6 of {1, 2, 3} 

— 1 if {r jk} is an anti-cyclic permutation of { 1 , 2, 3} 

0 otherwise. 

Is it true for e on R 4 that e,/*/ = 1 if [i jkl } is a cyclic permutation of {1, 2, 3, 4}? 


16 A cyclic permutation of { 1 , is any rearrangement of {1, .... «} obtained by succes- 
sively moving numbers from the beginning of the sequence to the end. That is, {2, 1), 

{3, ... ,n, 1,2}, and so on are the cyclic permutations of {1, ... ,n}. Anti-cyclic permutations are 
cyclic permutations of [n, n — 1, . . . , 1}. 


3.9 Antisymmetric Tensors 


73 


Example 3.27 The determinant 

You have doubtless encountered determinants before, and have probably seen them 
defined iteratively; that is, the determinant of a 2 x 2 square matrix A, denoted |A| 
(or det A), is defined to be 


I A | = A11A22 — A12A21 


(3.82) 


and then the determinant of a 3 x 3 matrix B is defined in terms of this, i.e. 


\B\ = B n 


B22 

B32 


B23 

B33 


— B 12 


B21 

B 31 


B23 

B33 


+ B\3 


B21 

B 31 


B22 

B32 


(3.83) 


This expression is known as the cofactor expansion of the determinant, and is not 
unique; one can expand about any row (or column), not necessarily (5 ji, B 12 , B 13 ). 


In our treatment of the determinant we will take a somewhat more sophisticated 
approach . 17 Take an n x n matrix A and consider its n columns as n column vectors 
in M", so that the first column vector A\ has zth component An and so on. Then, 
constructing the e tensor using the standard basis and inner product on R" , we define 
the determinant of A, denoted | A | or det A, to be 


I A | = e{A\, ... , A n ) 

or in components 


(3.84) 


l^|— £i\,...,i n Ain ■ ■ ■ Ai n n- 

II.-, Ik 


(3.85) 


You should check explicitly that this definition reproduces (3.82) and (3.83) for 
n = 2,3. You can also check in the Problems that many of the familiar properties 
of determinants (sign change under interchange of columns, invariance under addi- 
tion of rows, factoring of scalars) follow quite naturally from the definition and the 
multilinearity and antisymmetry of e. 

Since the determinant is defined in terms of the epsilon tensor, which has an 
interpretation in terms of volume, this suggests that the determinant has an inter- 
pretation in terms of volume as well. Consider our matrix A as a linear operator 
on R" ; then A sends the standard orthonormal basis {e\ ..... e n ] to a new, poten- 
tially non-orthonormal basis {Ae 1 , . . . , Ae„j. See Fig. 3.6. Then just as {ei, . . . , e „ ) 
spans a regular/! -cube whose volume is 1, the vectors {Ae 1 , . . . , Ae n } spanaskewed 
//-cube whose volume is given by 


e{Ae 1 , . . . , Ae„) = e( Aqieq, . . . , A inn e in 

V i\ i n 

= ■ ■ ■ Ai n „€i l 


II In 

= det A! 


17 For a complete treatment, however, you should consult Hoffman and Kunze [10], Chap. 5. 
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e-3 



Ae 3 



Fig. 3.6 The action of A on the standard basis in R 3 


Thus, the determinant of a matrix is the (oriented) volume of the skew «-cube 
obtained by applying A to the standard «-cube! You may have noticed that this 
volume can be negative, which is why we called the determinant an oriented (or 
signed ) volume; the interpretation of this is given in the next example. 

Example 3.28 Orientations and the e tensor 

Note This material is a bit abstract and may be skipped on a first reading. 

With the determinant in hand we may now explore to what extent the definition of 
e depends on our choice of orthonormal basis. Consider another orthonormal basis 
{er — A J .,ej}. If we define an e' in terms of this basis, we find 

e' = e 1 ' A • • • a/ 

= A)' . . . A'-'V 1 A • • • A e ln 

1 1 l n 

= Aj'...Afe h - in e 1 A---Ae n 

‘1 l n 

= \A\e (3.86) 

where in the third equality we used the fact that if e' 1 A • ■ • A e‘ n does not vanish 
it can always be rearranged to give e l A • • • A e n , and the resulting sign change if 
any is accounted for by the Levi-Civita symbol. Now since both {e, } and {e,/} are 
orthonormal bases, A must be an orthonormal matrix, so we can use the product 
rule for determinants \AB\ = |A||fi| (see Problem 3.4 for a simple proof) and the 
fact that \A T \ = |A| to get 

1 = |/| = | A A r | = | A 1 1 A 7 | = | A | 2 (3.87) 

which implies | A | = ± 1 . Thus by (3.86) e' = f if the two orthonormal bases used in 
their construction are related by an orthogonal transformation A with | A | = 1 ; such 
a transformation is called a rotation , 1 8 and two bases related by a rotation, or by any 


18 No doubt you are used to thinking about a rotation as a transformation that preserves distances 
and fixes a line in space (the axis of rotation). This definition of a rotation is particular to R 3 , since 
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Right-handed bases 


Left-handed bases 


Fig. 3.7 The two orientations of R 3 . The upper-left most basis is usually considered the standard 
basis 


transformation with | A| >0, are said to have the same orientation. If two bases are 
related by a basis transformation with |A| < 0 then the two bases are said to have 
the opposite orientation. We can then define an orientation as a maximal 19 set of 
bases all having the same orientation, and you can show (see Problem 3.6) that R" 
has exactly two orientations. In R 3 these two orientations are the right-handed bases 
and the left-handed bases, and are depicted schematically in Fig. 3.7. Thus we can 
say that e does not depend on a particular choice of orthonormal basis, but it does 
depend on a metric and a choice of orientation, where the orientation chosen is the 
one determined by the standard basis. 

The notion of orientation allows us to understand the interpretation of the de- 
terminant as an ‘oriented’ volume: the sign of the determinant just tells us whether 
or not the orientation of {Ae,} is the same as \e, }. Also, for orientation-changing 
transformations on R 3 one can show that A can be written as A = Ao(— /), where 
Ao is a rotation and —/is referred to as the inversion transformation. The inversion 
transformation plays a key role in the next example. 

Example 3.29 Pseudovectors in R 3 

Note All indices in this example refer to orthonormal bases. The calculations and 
results below do not apply to non-orthonormal bases. 


even in R 2 a rotation cannot be considered to be “about an axis” since z ^ R 2 . For the equivalence 
of our general definition and the more intuitive definition in R 3 , see Goldstein [6], 

19 I.e. could not be made bigger. 
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A pseudovector (or axial vector) is a tensor on R 3 whose components trans- 
form like vectors under rotations but do not change sign under inversion. Common 
examples of pseudovectors are the angular velocity vector &>, the magnetic field vec- 
tor B, as well as all cross products, such as the angular momentum vector L = rxp. 
It turns out that pseudovectors like these are actually elements of A 2 R 3 , which are 
known as bivectors. 

To see the connection, consider the wedge product of two vectors r, p e M 3 . This 
looks like 

r A p = (r + r 2 e 2 + re 3) A (p 1 ei + p 2 e 2 + p 3 e 3) 

= (r 1 p 2 — r 2 p l )ei Ae 2 + (r 3 p l — r'p 3 )e 3 A e\ + (r 2 p 3 — r 3 p 2 )e 2 A e 3. 
This looks just like r x p if we make the identifications 


e\ A e 2 — > £3 

ej, A e\ — > e 2 (3.88) 

e 2 A e-i — > e\ . 


In terms of matrices, this corresponds to the identification 



(3.89) 


This identification can be embodied in a one-to-one and onto map J from A 2 R 3 
to R 3 . If a. e A 2 R 3 then we can expand it as 

23 31 12 

a — a e 2 A e^ + a ej, A e\ + a e\ A e 2 , 

and we then define J in component as 


J : A 2 R 3 -a R 3 

1 (3.90) 

a ,J hA (/(a))' = -e‘jk(x jk . 

You will check below that this definition really does give the identifications writ- 
ten above. Note that J is essentially just a contraction with the epsilon tensor. With 
this, we see that r x p is really just J (r A p) ! Thus, cross products are essentially 
just bivectors. 


Exercise 3.32 Check that J , as defined by (3.90), acts on basis vectors as in (3.88). Also 
check that when written in terms of matrices, J produces the map (3.89). 


Now that we know how to identify bivectors and regular vectors, we must ex- 
amine what it means for bivectors to transform Tike’ vectors under rotations, but 
without a sign change under inversion. On the face of things, it seems like bivectors 
should transform very differently from vectors; after all, a bivector is a (0, 2) tensor, 
and you can show that it has matrix transformation law 

[«]#' = A[a]^A T . 


(3.91) 
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This looks very different from the transformation law for the associated vector J (a), 
which is just (cf. (3.29)) 


[j{a)] B , = A[j{a)] B . (3.92) 

In particular, the bivector a transforms with two copies of A, and the vector J (a) 
with just one. How could these transformation laws be ‘the same’? Well, remember 
that a is not any old (0, 2) tensor, but an antisymmetric one, and for these a small 
miracle happens. This is best appreciated by considering an example. Let A be a 
rotation about the z-axis by an angle #, so that 


( cos# —sin# 0\ 
sin# cos# 0 I . 

0 0 1 / 


Then you can check that, with a 1 = (J (a))' , 


Mb' = A[oi]bA t 


/ cos# 

— sin# 

o\ 

( ° 

— a 3 

« 2 

\ 

sin# 

cos# 

° 

a 3 

0 

—a [ 


V o 

0 

1 / 

V-a 2 

a 1 

0 

/ 

/ 

0 



— O' 3 




a 3 



0 




J 

— > 


y — or cos # — a 1 sin # 

/ —or sin 0 + a 1 cos 9 \ 


a 2 cos 9 + a 1 sin# 

\ « 3 


-a 2 sin# + a 1 cos# 


cos# sin# 0\ 

— sin# cos# 0 I 

0 0 1 / 

a 2 cos# + a 1 sin#'' 
or sin# — a 1 cos# 

0 

(3.93) 

(3.94) 


which is exactly a rotation about the "-axis of /(a)! This seems to suggest that if 
we transform the components of a e A 2 R 3 by a rotation first and then apply 7, or 
apply J and then rotate the components, we get the same thing. In other words, the 

map J commutes with rotations, and that is what it means for both bivectors 
and vectors to behave the same’ under rotations. 


Exercise 3.33 Derive (3.91). You may need to consult Sect. 3.2. 

Exercise 3.34 Verify (3.93) by performing the necessary matrix multiplication. 


To prove that / commutes with arbitrary rotations (the example above just 
proved it for rotations about the z-axis), we need to show that 


A l j a 


J = - j' A k ' A 1 ' n/ m " 
2 6 k l A m A n a 


(3.95) 


where A is a rotation. On the left hand side J is applied first followed by a rotation, 
and on the right hand side the rotation is done first, followed by J . 
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We now compute 


1 

2 


' \k! mn 

k'l'A m A a 


\e p , vv V’v' A k y n a mn 

^J2 e P'k’l' A qAq A^Ol 1 

q 

l -Y.^n\A\A^a m >' 


q 

= l -\A\e\„,A i y" 

= \A\A‘'a“ (3.96) 

where in the second equality we used a variant of (3.32) which comes from writ- 
ing out A A 1 = I in components, in the third equality we used the easily verified 


fact that (p'k'V Aq A^A l n = \A\e qmn , and in the fourth equality we raised an index 
to resume use of Einstein summation convention and were able to do so because 
covariant and contravariant components are equal in orthonormal bases. Now, for 
rotations |A| = 1 so in this case (3.96) and (3.95) are identical and the compo- 
nents of J(a ) transform like the components of a vector. For inversion, however, 

| A | = | — / 1 = — 1 so (3.96) tells us that the components of J (a) do not change sign 
under inversion, as those of an ordinary vector would. Another way to see this is 
given in the exercise below. 


Exercise 3.35 Use (3.91) to show that the components of J(a) do not change sign under 
inversion. 


We have thus shown that pseudovectors are bi vectors, since bivectors transform 
like vectors under rotation but do not change sign under inversion. We have also 
seen that cross products are very naturally interpreted as bivectors. There are other 
pseudovectors lying around, though, that do not naturally arise as cross products. 
For instance, what about the angular velocity vector col 


Example 3.30 The angular velocity vector 


The angular velocity vector co is usually introduced in the context of rigid body 
rotations. One usually fixes the center of mass of the body, and then the velocity v 
of a point of the rigid body is given by 

v=®xr (3.97) 

where r is the position vector of the point as measured from the center of mass. The 
derivation of this equation usually involves consideration of an angle and axis of 
rotation, and from these considerations one can argue that co is a pseudovector. Here 
we will take a different approach in which co will appear first as an antisymmetric 
matrix, making the bivector nature of co manifest. 

Let K and K' be two orthonormal bases for R 3 as in Example 2.11, with K 
time-dependent. One should think of K as being attached to the rotating rigid body. 
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Fig. 3.8 Our rigid body with fixed space frame K' and body frame K in gray 

whereas K' is fixed. We will refer to K as the body frame and K' as the space frame. 
Both frames have their origin at the center of mass of the rigid body. This is depicted 
in Fig. 3.8. 

Now let r represent a point of the rigid body; then [rk will be its coordinates 
in the body frame, and [r]^/ its coordinates in the space frame. Let A be the (time- 
dependent) matrix of the basis transformation taking K' to K , so that 

[r] K ' = A[r] K . (3.98) 

We would now like to calculate the velocity [vk' of our point in the rigid body, as 
seen in the space frame, and compare it to (3.97). This is given by just differentiat- 
ing [!•]*•' : 

d 

[vk' = — [r k' 

dt 

= ^tA [ rk by (3.98) 
dt 
dA 

= — [rk since [rk is constant 
dt 

dA | 

= — A-‘[rk'. (3.99) 

dt 

So far this does not really look like (3.97). What to do? First, observe that ^ A” 1 
is actually an antisymmetric matrix: 
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= *±A' 


dA 1 

__ . A 

dt dt 

dA aT / dA 
= — A r + ——A 


dt 


\ dt 


(3.100) 


We can then define an angular velocity bivector cb whose components in the space 
frame are given by 


dA 


[®k' = ~r A 

dt 


-l 


(3.101) 

Then we simply define the angular velocity vector cd to be 

co=J(cb). (3.102) 

Note that co is, in general, time-dependent. It follows from this definition that 

/ 0 -co 3 ' co 2 ' \ 

dA , 

— A~ l = 
dt 


co 3 0 — co 1 

V-® 2 ' co 1 ' 0 


(we use primed indices since we are working with the K' components), so then 


dA 

dt 


i —1 r 



( ° 

3 

rn 

3 

1 

= 

® 3 ' 

0 -c 


U® 2 ' 

co 1 ' 


/ 2' 3' 

( CD 

— CD 3 ' X 2 ' ) 

= 

3' V 
CO X 

1' 3' 

— CD X 


1 V 2' 
\® X 

2' 1' , 
- CD X ) 

= [® x rjtf 

\ 


//\ 


WJ 


Combining this with (3.99), we then have 


[vk- = [»xr] r (3.103) 

which is just (3.97) written in the space frame! Thus the ‘pseudovector’ co can 
be viewed as nothing more than the vector associated to the antisymmetric matrix 
^ A~ 1 , and multiplication by this is just the cross product with col □ 


Exercise 3.36 Use (3.91) to show that the bivector £> in the body frame is 


[w\k = A 1 


dA 

dt 


Combine this with (3.99) to show that (3.97) is true in the body frame as well. 


In the last example we saw that to any time-dependent rotation matrix A we 
could associate an antisymmetric matrix ^M -1 , which we can identify with the 
angular velocity vector which represents ‘infinitesimal’ rotations. This association 
between finite transformations and their infinitesimal versions, which in the case of 
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rotations takes us from orthogonal matrices to antisymmetric matrices, is precisely 
the relationship between a Lie group and its Lie algebra. We turn our attention to 
these objects in the next part of this book. 


3.10 Problems 

3.1 In this problem we explore the properties of n x n orthogonal matrices. This is 
the set of real invertible matrices A satisfying A 1 — A -1 , and is denoted 0{n). 

(a) Is 0(n) a subspace of M„(K)? 

(b) Show that the product of two orthogonal matrices is again orthogonal, that the 
inverse of an orthogonal matrix is again orthogonal, and that the identity ma- 
trix is orthogonal. These properties show that 0(n) is a group , i.e. a set with 
an associative multiplication operation and identity element such that the set 
is closed under multiplication and every element has a multiplicative inverse. 
Groups are the subject of Chap. 4. 

(c) Show that the columns of an orthogonal matrix A, viewed as vectors in 1R", are 
mutually orthogonal under the usual inner product. Show the same for the rows. 
Show that for an active transformation, i.e. 

[ei']s — A[ei])3 

where B = {e,} i= i „ so that 

Mb = ( 0 0), 

/thslot 

the columns of A are the [tyjg. In other words, the components of the new basis 
vectors in the old basis are just the columns of A. This also shows that for the 
corresponding passive transformation, in which 

[ ei ] B , = A _ 1 [e,]g 

the columns of A - 1 are the components of the old basis vectors in the new basis. 

(d) Show that the orthogonal matrices A with |A| = 1, the rotations, form a sub- 
group unto themselves, denoted SO{n). Do the matrices with |A| = —1 also 
form a subgroup? 

3.2 In this problem we will compute the dimension of the space of (0, r) symmetric 
tensors S' (V). This is slightly more difficult to compute than the dimension of the 
space of (0, r) antisymmetric tensors A r V, which was Exercise 3.30. 

(a) Let dim V = n and {e; };= be a basis for V . Argue that dim S' ( V ) is given 
by the number of ways you can choose r (possibly repeated) vectors from the 
basis {e/}i=i, 
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(b) We have now reduced the problem to a combinatorics problem: how many ways 
can you choose r objects from a set of n objects, where any object can be chosen 
more than once? The answer is 



(3.104) 


Try to derive this on your own. If you need help, the solution to this is known as 
the “stars and bars” or “balls and walls” method; you can also refer to Sternberg 
[16], Chap. 5. 

3.3 Prove the following basic properties of the determinant directly from the defi- 
nition (3.84). We will restrict our discussion to operations with columns, though it 
can be shown that all the corresponding statements for rows are true as well. 

(a) Any matrix with a column of zeros has | A \ = 0. 

(b) Multiplying a column by a scalar c multiplies the whole determinant by c. 

(c) The determinant changes sign under interchange of any two columns. 

(d) Adding two columns together, i.e. sending A, — >• A,- + A j for any i and j , does 
not change the value of the determinant. 

3.4 One can extend the definition of determinants from matrices to more general 
linear operators as follows: We know that a linear operator T on a vector space V 

(equipped with an inner product and orthonormal basis {e, }, = i „ ) can be extended 

to an operator on the /j-fold tensor product T® ( V ) by 


T(v\ <g> • • - <g> v p ) = (Tvi) <g> • - - <g> (Tv p ). 


Since A" V C T®(V), the action of T then extends to A" V similarly by 
T(v i A • • • A v„) = (Tvi) A • • • A (T v n ). 

Consider then the action of T on the contravariant version of e, the tensor e = 
e 1 A • ■ • A e„ . We know from Exercise 3.30 that A" V is one-dimensional, so that 
T(e) = (Te\) A • • • A (T e n ) is proportional to e, and we define the determinant of 
T to be this proportionality constant, so that 


(Te\) A • • ■ A (T e n ) = \T\e\ A--Ae„. 


(3.105) 


(a) Show by expanding the left hand side of (3.105) in components that this more 
general definition reduces to the old one of (3.85) in the case of V = M". 

(b) Use this definition of the determinant to show that for two linear operators B 
and C on V, 


\BC\ — |B||C|. 


In particular, this result holds when B and C are square matrices. 

(c) Use (b) to show that the determinant of a matrix is invariant under similarity 
transformations (see Example 3.8). Conclude that we could have defined the 
determinant of a linear operator T as the determinant of its matrix in any basis. 
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3.5 Let V be a vector space with an inner product and orthonormal basis 

{ e,-},-= i „. Prove that a linear operator T is invertible if and only if 7’ ^ 0, as 

follows: 

(a) Show that T is invertible if and only if {T(ej)}i=\ is a linearly independent 
set (see Exercise 2.7 for the ‘if’ part of the statement). 

(b) Show that |7j ^ 0 if and only if {7’(e,)} ;= i n is a linearly independent set. 

(c) This is not a problem, just a comment. In Example 3.27 we interpreted the 
determinant of a matrix A as the oriented volume of the n-cube determined by 
{ Ae, } . As you just showed, if A is not invertible then the Ae, are linearly depen- 
dent, hence span a space of dimension less than n and thus yield n -dimensional 
volume 0. Thus, the geometrical picture is consistent with the results you just 
obtained! 

3.6 Let B be the standard basis for R", O the set of all bases related to B by a 
basis transformation with |A| >0, and O' the set of all bases related to B by a 
transformation with | A| <0. 

(a) Using what we have learned in the preceding problems, show that a basis trans- 
formation matrix A cannot have | A| =0. 

(b) O is by definition an orientation. Show that O' is also an orientation, and con- 
clude that R" has exactly two orientations. Note that both O and O' contain 
orthonormal and non-orthonormal bases. 

(c) For what n is A = — / an orientation-changing transformation? 
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In physics we are often interested in how a particular object behaves under a par- 
ticular set of transformations. For instance, one often reads that under rotations a 
dipole moment transforms like a vector and a quadrupole moment like a second 
rank tensor, and that electric and magnetic fields transform independently like vec- 
tors under rotations but collectively like a second rank antisymmetric tensor under 
Lorentz transformations. Similarly, in quantum mechanics one is often interested 
in the “spin” of a ket (which specifies how it transforms under rotations), or its 
behavior under the time-reversal or space inversion (parity) transformations. This 
knowledge is particularly useful as it leads to the many famous “selection rules” 
which greatly simplify evaluation of matrix elements. Transformations are also cru- 
cial in quantum mechanics because, as we will see, all physical observables can be 
considered as “infinitesimal generators” of particular transformations; for example, 
the angular momentum operators “generate” rotations (as we discussed briefly in 
Problem 2.3) and the momentum operator “generates” translations. 

Like tensors, this material is often treated in a somewhat ad hoc way which fa- 
cilitates computation but obscures the underlying mathematical structures. These 
underlying structures are known to mathematicians as group theory, Lie theory and 
representation theory, and are known collectively to physicists as just “group the- 
ory”. Our aim in this second part of the book is to present the basic facts of this 
theory as well as its manifold applications to physics, both to clarify and unify the 
diverse phenomena in physics which it underlies and also to provide a nice applica- 
tion of what we have learned about tensors. 

Before we discuss how particular objects transform, however, we must discuss 
the transformations themselves, in both their ‘finite’ and ‘infinitesimal’ form. That 
discussion is the subject of the present chapter. We will begin with a discussion of 
the ‘finite’ transformations, all of which share a few common properties: First, the 
performance of two successive transformations is always equivalent to the perfor- 
mance of a single, third transformation (just think of rotations and how any two 
successive rotations about two axes can be considered as a single rotation about a 
third axis). Second, every transformation has an inverse which undoes it (in the case 
of rotations, the inverse to any given rotation is a rotation about the same axis in the 
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opposite direction). Sets of transformations like these occur so often in mathematics 
and physics that they are given a name: groups. 


4.1 Groups — Definition and Examples 

The definition of a group that we are about to give may appear somewhat abstract, 
but is just meant to embody the most important properties of sets of transformations. 
After giving the definition and establishing some basic properties, we will proceed 
shortly to some concrete examples. 

A group is a set G together with a ‘multiplication’ operation, denoted •, that 
satisfies the following axioms: 

1. (Closure) g, h e G implies g ■ h e G. 

2. (Associativity) For g , h , k e G , g ■ (h ■ k) — (g ■ h) ■ k . 

3. (Existence of the identity) There exists an element e e G such that g-e = e- g — g 
WgeG. 

4. (Existence of inverses) Vg e G there exists an element h e G such that g ■ h — 
h ■ g — e. 

If we think of a group as a set of transformations, as we usually do in physics, 
then the multiplication operation is obviously just composition; for instance, if R 
and S are three-dimensional rotations, then R ■ S is just S followed by R. Note that 
we do not necessarily have R ■ S — S ■ R for all rotations R and S ; in cases such as 
this, G is said to be non-commutative (or non-abelian). If we did have S ■ R = R ■ S 
for all R. S e G, then we would say that G is commutative (or abelian ). 

There are several important properties of groups that follow almost immediately 
from the definition. Firstly, the identity is unique, for if e and / are both elements 
satisfying axiom 3 then we have 

e — e ■ f since / is an identity 
= / since e is an identity. 

Secondly, inverses are unique : Let g e G and let h and k both be inverses of g. 
Then 


g ■ h — e (4.1) 

so multiplying both sides on the left by k gives 
k ■ (g ■ h) = k, 

(k ■ g) ■ h = k by associativity, 

e ■ h — k since k is an inverse of g, 
h = k. 

We henceforth denote the unique inverse of an element g as g -1 . 
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Thirdly, if g e G and h is merely a right inverse for g, i.e. 

g-h = e, (4.2) 

then li is also a left inverse for g and is hence the unique inverse g~ 1 . This is seen 
as follows: 

h '8 = (g _1 ' g) ■ ( h ' g) 

= (g _ 1 ■ (g • h )) • g by associativity 
= {g~ l -e)-g by (4.2) 

= g~ { 'g 
= e, 

so h = g~ l . 

The last few properties concern inverses and can be verified immediately: 

U“T 1 = s 

(, g-hy 1 =h~ l ■ g~ l 
e~ l = e. 

Exercise 4.1 Prove the cancelation laws for groups, i.e. that 

g\ ■ h = g2 ■ h => g i = g2 
hgi=hg 2 =4> g i = g2- 

Before we get to some examples, we should note that properties 2 and 3 in the 
definition above are usually obviously satisfied and one rarely needs to check them 
explicitly. The important thing in showing that a set is a group is verifying that 
it is closed under multiplication and contains all its inverses. Also, as a matter of 
notation, from now on we will usually omit the • when writing a product, and simply 
write gh for g ■ h. 

Example 4.1 R: The real numbers as an additive group 

Consider the real numbers M with the group ‘multiplication’ operation given by 
regular addition, i.e. 


x-y = x + y, tjeR. (4.3) 

It may seem counterintuitive to define ‘multiplication’ as addition, but the definition 
of a group is rather abstract so there is nothing that prevents us from doing this, and 
this point of view will turn out to be useful. With addition as the product, M becomes 
an abelian group: The first axiom to verify is closure, and this is satisfied since 
the sum of two real numbers is always a real number. The associativity axiom is 
also satisfied, since it is a fundamental property of real numbers that addition is 
associative. The third axiom, dictating the existence of the identity, is satisfied since 
OeR fits the bill. The fourth axiom, which dictates the existence of inverses, is 
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satisfied since for any reR, — x is its (additive) inverse. Thus R is a group under 
addition, and is in fact an abelian group since x + y = y + x, Vx, y e R. 

Note that R is not a group under regular multiplication, since 0 has no multi- 
plicative inverse. If we remove 0, though, then we do get a group, the multiplicative 
group of nonzero real numbers, denoted R*. We leave it to you to verify that R* 
is a group. You should also verify that this entire discussion goes through for C as 
well, so that C under addition and (C* = C\{0} under multiplication are both abelian 
groups. 

Example 4.2 Vector spaces as additive groups 

The previous example of R and C as additive groups can be generalized to the case 
of vector spaces, which are abelian groups under vector space addition. The group 
axioms follow directly from the vector space axioms, as you should check, with 0 
as the identity. While viewing vector spaces as additive groups means we ignore 
the crucial feature of scalar multiplication, we will see that this perspective will 
occasionally prove useful. 

Example 4.3 GL(V), GL(n, R), and GL(n, C): The general linear groups 

The general linear group of a vector space V, denoted GL(V), is defined to be 
the subset of CiV) consisting of all invertible linear operators on V . We can easily 
verify that GL(V ) is a group: to verify closure, note that for any T,U e GL(V), TU 
is linear and ( TU)~ l — U~ l T _1 , so TU is invertible. To verify associativity, note 
that for any T,U,V e GL{V ) and teV.we have 

(T(UV))( v) = T(U{V(V))) = {(TU)V)(v) (4.4) 

(careful unraveling the meaning of the parentheses!) so that T(UV) — (TU)V . To 
verify the existence of the identity, just note that I is invertible and linear, hence in 
GL(V). To verify the existence of inverses, note that for any T e GL(V ), T~ [ exists 
and is invertible and linear, hence is in GL(V) also. Thus GL(V) is a group. 

Let V have scalar field C and dimension n. If we pick a basis for V , then for 
each T e GL(V ) we get an invertible matrix [ 7’] e M n (C). Just as all the invertible 
T e Ci V ) form a group, so do the corresponding invertible matrices in M„(C); 
this group is denoted as GL(n, C ), and the group axioms can be readily verified 
for it. 1 When C = R we get GL(n, R), the real general linear group in n dimen- 
sions, and when C — C we get GL(n, C), the complex general linear group in n 
dimensions. □ 

While neither GL(V), GL(n. M), nor GL(n , C) occur explicitly very often in 
physics, they have many important subgroups, i.e. subsets which themselves are 
groups. The most important of these arise when we have a vector space V equipped 


'You may recall having met GL(n, R) at the end of Sect. 2.1. There we asked why it is not a vector 
space, and now we know — it is more properly thought of as a group! 
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with a non-degenerate Hermitian form (•!•)• In this case, we can consider the set of 
isometries Isom( V), consisting of those operators T which ‘preserve’ (• | •) in the 
sense that 


(T v\T w) = (v\w) Vv,w,eV. 


(4.5) 


If (• | •) can be interpreted as giving the ‘length’ of vectors, then an isometry T can 
be thought of as an operator that preserves lengths. 2 3 Note that any such T is invert- 
ible (why?), hence Isom(y) C GL{V). Isom(V) is in fact a subgroup of GL(V), as 
we will now verify. First off, for any T,U e Isom(V), 


{(TU)v\(TU)w) = (T{Uv)\T{Uw)) 

— ( Uv\Uw ) since T is an isometry 
= (v | w) since U is an isometry 

so TU is an isometry as well, hence Isom(V) is closed under multiplication. As 
for associativity, this axiom is automatically satisfied since I som ( V ) C GL(V) and 
multiplication in GL(V ) is associative, as we proved above. As for the existence of 
the identity, just note that the identity operator I is trivially an isometry. To verify 
the existence of inverses, note that T~ l exists and is an isometry since 

(7’ -1 i>|7’ -1 it>) = (TT~ l v\TT~ l w) since T e Isom(F) 

= (n|w). 

Thus Isom(F) is a group. Why is it of interest? Well, as we will show in the next few 
examples, the matrix representations of lsom( V) actually turn out to be the orthog- 
onal matrices, the unitary matrices, and the Lorentz transformations, depending on 
whether or not V is real or complex and whether or not (• | •) is positive-definite. Our 
discussion here shows’ that all of these sets of matrices are groups, and that they 
can all be thought of as representing linear operators which preserve the relevant 
non-degenerate Hermitian form. 


Example 4.4 The orthogonal group 0(n ) 


Let V be an n -dimensional real inner product space. The isometries of V can be 
thought of as operators which preserve lengths and angles, since the formula 


(v\w) 


i> w 


for the angle between v and w is defined purely in terms of the inner product. Now, 
if T is an isometry and we write out (i>, w) = (Tv\Tw) in components referred to 
an orthonormal basis B, we find that 


2 Hence the term ‘iso-metry’ = ‘same length’. 

3 We are glossing over some subtleties with this claim. See Example 4.17 for the full story. 
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[u] r [w] = 

= (Tv\Tw) 

= 8ij(Tvy(Tv)j 
= 8 ij T i k v k T> l w l 
= Y j V k T i k T i l w l 

i 

= [v} t [T] t [T}[w] Vi>, w, e V. (4.6) 

As you will show in Exercise 4.2, this is true if and only if [T] t [T] = /, or 

[T] t = [T]~ 1 . (4.7) 

This is the familiar orthogonality condition, and the set of all orthogonal matrices 
(which, as we know from the above discussion and from Problem 3.1, form a group) 
is known as the orthogonal group O(n). We will consider O(n ) in detail for n — 2 
and 3 in the next section. 

Exercise 4.2 Show that 


[u] r [u)] = [w] 7 ’[7’] 7 '[7’][i«;] Vt.ireV (4.8) 

if and only if [T] t \T] = I . One direction is easy; for the other, let v = a, w = e j where 
{e, is orthonormal. 

Exercise 4.3 Verify directly that 0(n) is a group, by using the defining condition (4.7). 
This is the same as Problem 3.1(b). 


Example 4.5 The unitary group U (n) 


Now let V be a complex inner product space. Recall from Problem 2.6 that we can 
define the adjoint T ' of a linear operator T by the equation 


(r 1 v|u;) = (v\T w). 


(4.9) 


If T is an isometry, then we can characterize it in terms of its adjoint, as follows: 
first, we have 


(u| w) = (T v\Tw ) 

= (7’ + ru|w) Vt >,weV. 

Calculations identical to those of Exercise 4.2 then show that this can be true if and 
only if T'r — 7 ’ 7 ’t _ / which is equivalent to the more familiar condition 



(4.10) 


Such an operator is said to be unitary. Thus every isometry of a complex inner 
product space is unitary, and vice versa. Now, if V has dimension n and we choose 
an orthonormal basis for V, then we have [T 1 ] = [7']‘ (cf. Problem 2.4), and so 
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(4.10) implies 

[r] f = [7T 1 . (4.11) 

Thus, in an orthonormal basis, a unitary operator is represented by a unitary matrix! 
(Note that this is NOT necessarily true in a non-orthonormal basis.) By the discus- 
sion preceding Example 4.4, the set of all unitary matrices forms a group, denoted 
U (n). We will not discuss U (n) in depth in this text, but we will discuss one of its 
cousins, SU( 2) (to be defined below), extensively. 

Note that there is nothing in the above discussion that requires V to be com- 
plex, so we can actually use the same definitions (of adjoints and unitarity) to define 
unitary operators on any inner product space, real or complex. Thus, a unitary op- 
erator is just an isometry of a real or complex inner product space. 4 In the case of 
a real vector space, the unitary matrix condition (4.11) reduces to the orthogonality 
condition (4.7), as you might expect. 

Exercise 4.4 Verify directly that U (n) is a group, using the defining condition (4.1 1). 


Example 4.6 The Lorentz group 0(n — 1,1) 


Now let V be a real vector space with a Minkowski metric t], which is defined, as 
in Example 2.19, as a symmetric, non-degenerate (2, 0) tensor whose matrix in an 
orthonormal basis has the form 


/I 


M = 



(4.12) 


with zeros on all the off-diagonals. This is to be compared with (2.25), which is 
just (4.12) with n = 4. Now, since q is a non-degenerate Hermitian form, we can 
consider its group of isometries. If T e Isom(V), then in analogy to the computation 
leading to (4.6), we have (in an arbitrary basis £>), 


[v] T [q][w] = q(v, w) 

= q (Tv, Tw) 

— [vY [T] T [q][T][w] Vv,weV. (4.13) 


Again, the same argument as you used in Exercise 4.2 shows that the above holds if 
and only if 

[Tf[r 1 ][T] = [r ] ] (4.14) 


4 In fact, the only reason for speaking of both “isometries” and “unitary operators” is that uni- 
tary operators act solely on inner product spaces, whereas isometries can act on spaces with 
non-degenerate Hermitian forms that are not necessarily positive-definite, such as R 4 with the 
Minkoswki metric. 
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which in components reads 


T/T v a ri pa = r ] pLV 
If B is orthonormal, (4.14) becomes 


(4.15) 


/I 

[ Tf 

V 


\ 


[T] = [r,] (4.16) 


-v 


which you will recognize from (3.37) as the definition of a Lorentz transformation, 
though now we are working in an arbitrary dimension n rather than just dimension 
four. We thus see that the set of all Lorentz transformations forms a group, known 
as the Lorentz group and denoted by 0(n — 1 , 1) (the notation just refers to the 
number of positive and negative Is present in the matrix form of i] given in (4.12)). 
The Lorentz transformations lie at the heart of special relativity, and we will take a 
close look at these matrices for n = 4 in the next section. □ 


Exercise 4.5 Verify directly that 0(n — 1, 1) is group, using the defining condition (4.14). 


Aside You may recall that we originally defined orthogonal matrices, unitary matrices, and 
Lorentz transformations as those matrices which implement a basis change from one orthonor- 
mal basis to another (on vector spaces with real inner products, Hermitian inner products, and 
Minkowski metrics, respectively). In the preceding examples, however, we have seen that these 
matrices can alternatively be defined as those which represent (in an orthonormal basis) operators 
which preserve a non-degenerate Hermitian form. These two definitions correspond to the active 
and passive viewpoints of transformations: our first definition of these matrices (as those which 
implement orthonormal basis changes) gives the passive viewpoint, while the second definition (as 
those matrices which represent isometries) gives the active viewpoint. 

Example 4.7 The special unitary and orthogonal groups SU(n ) and SO{n ) 

The groups 0(n) and U (n) have some very important subgroups, the special uni- 
tary and special orthogonal groups, denoted SU(n) and SO(n), respectively, which 
are defined as those matrices in U (n) and O(n) that have determinant equal to 1. 
You will verify below that these are subgroups of U(n) and 0{n). These groups 
are basic in mathematics and (for certain n) fundamental in physics: as we will see, 
SO(n) is the group of rotations in n dimensions, 577(2) is crucial in the theory of 
angular momentum in quantum mechanics, and though we will not discuss it here, 
SU(3) is fundamental in particle physics, especially in the mathematical description 
of quarks. 

Exercise 4.6 Show that SO(n) and SU(n) are subgroups of 0(n) and U (n). 

Before moving on to a more detailed look at some specific instances of the groups 
described above, we switch gears for a moment and consider groups that are not 
subsets of GL(n , C). These groups have a very different flavor than the groups we 
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have been considering, but are useful in physics nonetheless. We will make more 
precise the sense in which they differ from the previous examples when we get to 
Sect. 4.4. 

Example 4.8 Z 2 : The group with two elements 

Consider the set Z 2 = {+1, — 1} C Z with the product being just the usual multi- 
plication of integers. You can easily check that this is a group, in fact an abelian 
group. Though this group may seem trivial and somewhat abstract, it pops up in a 
few places in physics, as we will see in Sect. 4.3. 

Example 4.9 S n : The symmetric group on n letters 

This group does not usually occur explicitly in physics but is intimately tied to per- 
mutation symmetry, the physics of identical particles, and much of the mathematics 
we discussed in Sect. 3.8. The symmetric group on n letters (also known as the per- 
mutation group), denoted S n , is defined to be the set of all one-to-one and onto maps 
of the set {1, 2 , . . . , n) to itself, where the product is just the composition of maps. 
The maps are known as permutations . You should check that any composition of 
permutations is again a permutation and that permutations are invertible, so that S„ 
is a group. This verification is simple, and just relies on the fact that permutations 
are, by definition, one-to-one and onto. 

Any permutation a is specified by the n numbers <r(i), i = 1 and can 
conveniently be notated as 


1 2 • • • n 

ct(1 ) a (2) ••• a(n) 


n 


In such a scheme, the identity in S3 would just look like 



while the cyclic permutation o\ given byl— >-2,2— >-3,3 — > 1 would look like 



and the transposition <72 which switches 1 and 2 and leaves 3 alone would look like 



How do we take products of permutations? Well, the product o\ ■ 02 would take on 
the following values: 


(cti •cr 2 )(l) = CT 1 (CT 2 (l)) = cr 1 (2) = 3 
(tri ■ cr 2 )(2) = CD (1) = 2 
(aj ■ 02) (3) = 04(3) = 1 


(4.17) 

(4.18) 

(4.19) 
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so we have 


a\-a 2 = 


1 2 3 

2 3 1 


1 2 3 

2 1 3 


1 2 3 
3 2 1 


(4.20) 


You should take the time to inspect (4.20) and understand how to take such a product 
of permutations without having to write out (4.17)-(4.19). 

Though a proper discussion of the applications of S„ to physics must wait until 
Sect. 4.3, we point out here that if we have a vector space V and consider its n-fold 
tensor product T~(V), then 5„ acts on product states by 


er(l) 1 <g> V2 ® ® v n ) — v a (l) ® V a (2) ® • • • ® V a ( n )- 

A generic element of T^{V) will be a sum of such product states, and the action 
of a e S„ on these more general states is determined by imposing the linearity 
condition. In the case of n identical particles in quantum mechanics, where the to- 
tal Hilbert space is naively the n-fold tensor product T^('H) of the single-particle 
Hilbert space H , this action effectively interchanges particles, and we will later re- 
state the symmetrization postulate from Example 3.25 in terms of this action of S n 
on T®(Ti). 


Exercise 4.7 Show that 5„ has nl elements. 


4.2 The Groups of Classical and Quantum Physics 

We are now ready for a detailed look at some of the specific groups which arise in 
physics. 

Example 4.10 SO (2) : Special orthogonal group in two dimensions 


As discussed above, SO( 2) is the group of all orthogonal 2x2 matrices with deter- 
minant equal to 1. You will check in Exercise 4.8 that 5(9(2) is abelian and that the 
general form of an element of 5(9(2) is 


( cost? — sin (9 
sin0 cos0 


(4.21) 


You will recognize that such a matrix represents a counterclockwise rotation of 6 
radians in the x—y plane. Though we will not discuss 5(9(2) very much, it serves 
as a nice warmup for the next example, which is ubiquitous in physics and will be 
discussed throughout the text. 


Exercise 4.8 Consider an arbitrary matrix 



and impose the orthogonality condition, as well as | A\ = 1. Show that (4.21) is the most gen- 
eral solution to these constraints. Then, verify explicitly that 50(2) is a group (even though 
we already know it is by Exercise 4.6) by showing that the product of two matrices of the 
form (4.21) is again a matrix of the form (4.21). This will also show that 50(2) is abelian. 
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Example 4.11 SO (3): Special orthogonal group in three dimensions 

This group is of great importance in physics, as it is the group of all rotations in 
three-dimensional space! For that statement to mean anything, however, we must 
carefully define what a ‘rotation’ is. One commonly used definition is the following: 

Definition A rotation in n dimensions is any linear operator R which can be ob- 
tained continuously from the identity 5 and takes orthonormal bases to orthonormal 
bases, i.e. for any orthonormal basis {e,-},- = i „, j Ret |, = i „ must also be an or- 
thonormal basis. 

You will show in Problem 4.1 that this definition is equivalent to saying R e 
SO(n). 

Given that SO( 3) really is the group of three-dimensional rotations, then, can 
we find a general form for an element of SO( 3)7 As you may know from classical 
mechanics courses, an arbitrary rotation can be described in terms of the Euler 
angles, which tell us how to rotate a given orthonormal basis into another of the 
same orientation (or handedness). In classical mechanics texts, 6 it is shown that this 
can be achieved by rotating the given axes by an angle 0 around the original z-axis, 
then by an angle 6 around the new x-axis, and finally by an angle 0 around the new 
Z-axis. If we take the passive point of view, these three rotations take the form 

( cost j/ sin0 0\ /l 0 0 \ / cos 0 sin 0 0\ 

— sini jr cos0 0 I , I 0 cos0 sin0 I , I — sin0 cos 0 0 I 

0 0 1/ \0 — sin0 cos<9/ \ 0 0 1 / 

(4.22) 

so multiplying them together gives a general form for R e SO( 3): 

( cos 0 cos 0 — cos 9 sin 0 sin i jr cos 0 sin 0 + cos 9 cos 0 sin 0 

— sin 0 cos 0 — cos 6 sin 0 cos 0 — sin 0 sin 0 + cos 6 cos 0 cos 0 

sin 9 sin 0 — sin 0 cos 0 

Another general form for R e SO (3) is that of a rotation by an arbitrary angle 0 
about an arbitrary axis ii; you will see in Sect. 4.6 that this is given by 

( n\{\ — cost?) + cos# n x n y {l — cosf?) — n z sin(9 n. v /ir(l — cost?) + n y sin# ^ 

n y n x (\ — cosG) + n z sin(? /7?(1 — cost?) + cos9 n y n z (l — cosd) — n x sin0 
\ n z n xO ~ cos0) — tiy sin0 n z n y (l — cos8) + n x s'm9 n \0 — cos 9) + cosd y 

(4.24) 


sin 0 sin 9 
cos 0 sinS 

COS0 

(4.23) 


5 Meaning that there exists a continuous map y : [0, 1] — > GL(n,R) such that y(0) = I and 
y (1) = R. In other words, there is a path of invertible matrices connecting R to /. 

6 Such as Goldstein [6], 
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where n = (n x , n y , n z ) and the components of n are not all independent since n 2 + 
n 2 +n 2 = 1 . This constraint, along with the three components of n and the angle 0, 
then give us three free parameters with which to describe an arbitrary rotation, just 
as with the Euler angles. For a nice geometric interpretation of the above matrix, see 
Problem 4.3. 

Example 4.12 0(3): Orthogonal group in 3 dimensions 

If SO( 3) is the group of all three-dimensional rotations, then what are we to make 
of 0(3), the group of all orthogonal 3x3 matrices without the restriction on the 
determinant? Well, as we pointed out in Example 3.27, the orthogonality condition 
actually implies 7 that \R\ = ±1, so in going from SO( 3) to 0(3) we are just adding 
all the orthogonal matrices with | R\ = — 1 . These new matrices are sometimes re- 
ferred to as improper rotations, as opposed to the elements with Aj = I which are 
known as proper rotations. Now, amongst the improper rotations is our old friend 
the inversion transformation, —I, which should be familiar from Example 3.27. 
Any improper rotation can be written as the product of a proper rotation and the 
inversion transformation, as R — (—I)(—R) (note that if R is an improper rotation, 
then — R is a proper rotation). Thus, an improper rotation can be thought of as a 
proper rotation followed 8 by the inversion transformation. One important feature of 
0(3) is that its two parts, the proper and improper rotations, are disconnected , in 
the sense that one cannot continuously go from matrices with \R\ = 1 to matrices 
with \R\ = — 1 (if one can continuously go from one group element to any other, 
then the group is said to be connected. It is disconnected if it is not connected). 
One can, however, multiply by — I to go between the two components. This is rep- 
resented schematically in Fig. 4.1. Note that the stipulation in our definition that a 
rotation must be continuously obtainable from the identity excludes all the improper 
rotations, as it should. 

Example 4.13 SU(2): Special unitary group in two complex dimensions 

As mentioned in Example 4.7, SU{2) is the group of all 2 x 2 complex matrices A 
which satisfy | A | = 1 and 

At = A _1 . 

You can check (see Exercise 4.9 below) that a generic element of SU(2) looks like 

f), a,PeC, \a\ 2 + \/3\ 2 = l. (4.25) 


7 This fact can be understood geometrically: since orthogonal matrices preserve distances and an- 
gles, they should preserve volumes as well. As we learned in Example 3.27, the determinant mea- 
sures how volume changes under the action of a linear operator, so any volume preserving operator 
should have determinant ± 1 . The sign is determined by whether or not the orientation is reversed. 

8 One can actually think of the inversion as following or preceding the proper rotation, since —I 
commutes with all matrices. 
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-I 


-> 



proper rotations 


improper rotations 


Fig. 4.1 The two components of 0(3). The proper rotations are just 50(3). Multiplying by the 
inversion transformation — I takes one back and forth between the two components 

We could also use three real parameters with no conditions rather than two complex 
parameters with a constraint; one such parametrization is 



(4.26) 


where we have used the same symbols for our parameters as we did for the Euler 
angles. This is no accident, as there is a close relationship between 5(7 (2) and 50(3), 
which we will discuss in detail in the next section. This relationship underlies the 
appearance of 5(7 ( 2) in quantum mechanics, where rotations are implemented on 
spin 1 /2 particles by elements of 5(7(2), and in fact a rotation with Euler angles </;, 
0 and i jr is implemented by the matrix (4.26) ! 

Exercise 4.9 Consider an arbitrary complex matrix 



and impose the unit determinant and unitary conditions. Show that (4.25) is the most general 
solution to these constraints. Then show that any such solution can also be written in the 
form (4.26). 

Example 4.14 50(3, 1)„: The restricted Lorentz group 

The restricted Lorentz group 50(3, 1 ) 0 is defined to be the set of all A e 0(3, I ) 
which satisfy |A| = 1 as well as A 44 > 1. You will verify in Problem 4.4 that 
50(3, 1 ) 0 is a subgroup of 0(3, 1). Where does its definition come from? Well, just 
as O (3) can be interpreted physically as the set of all orthonormal coordinate trans- 
formations, 0(3, 1) can be interpreted as the set of all transformations between iner- 
tial reference frames. However, we often are interested in restricting those changes 
of reference frame to those which preserve the orientation of time and space, which 
is what the additional conditions |A| = 1 and A 44 > 1 do. The condition A 44 > 0 
means that A does not reverse the direction of time (so that clocks in the new coor- 
dinates are not running backwards) and this, together with |A| = 1, implies that A 
does not reverse the orientation of the space axes. Such transformations are known 
as restricted Lorentz transformations. 


100 


4 Groups, Lie Groups, and Lie Algebras 


The most familiar such transformation is probably 


/I 

0 

0 

0 \ 

0 

1 

0 

0 

0 

0 

Y 

-Py 


0 

~P Y 

Y / 


-1 <ft < 1 , 


Y = 


1 

7^' 


(4.27) 


This is interpreted passively 9 10 as a coordinate transformation to a new reference 
frame that is unrotated relative to the old frame but is moving uniformly along the z- 
axis with relative velocity ft Such a transformation is often referred to as a boost 
along the z-axis, and is also sometimes written as 


/ 1 0 0 0 \ 
0 10 0 
0 0 cosh u — sinh u 

y0 0 — sinhn coshw j 


u e R. 


(4.28) 


where u is a quantity known as the rapidity and is related to ft by tanh a — ft. 
(You should check that the above matrices really are elements of SO( 3, 1) 0 .) We 
could also boost along any other spatial direction; if the relative velocity vector is 
ft = ( ft x , fty , ft, ) , then the corresponding matrix should be obtainable from (4.27) 
by an orthogonal similarity transformation that takes z into ft. You will show in 
Exercise 4.10 below that this yields 


/ft?(y-i) i i PxPyiy - 0 Aftty-i) a ,.\ 

fi 2 “1“ 1 fl 2 fl 2 PxY 


L = 


PyPAv- 1 ) Py(y-t) , , 

fl2 + 1 


^ 4 ^ ~PyY 


fcfe(y-D 

a 2 

V -PxY 


PzPytY- 1) Plt.Y- 1) , , 

R2 fl2 ”T 1 


P 2 
~PyY 


-PzY 


PzY 
Y ) 


(4.29) 


If we generalize the relation between u and ft to three dimensions as 


tanhn 

ft = u, u — |u| 

u 


(4.30) 


9 It is worth noting that, in contrast to rotations, Lorentz transformations are pretty much always 
inteipreted passively. A vector in R 4 is considered an event, and it does not make much sense 
to start moving that event around in spacetime (the active interpretation), though it does make 
sense to ask what a different observer’s coordinates for that particular event would be (passive 
inteipretation). 

10 Note that ft is measured in units of the speed of light, hence the restriction — 1 < ft < 1. 
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then you can check that this arbitrary boost can also be written as 


/ «J(cosh«-l) , , UxUy (cosh it 1 ) M x « z (cosh«-l) 
— 22 r t " 22 22 


L = 


Uyiix (cosh u — 1 ) M^(coshu-l) 


+ 1 


w^H-(coshw— 1) 


u^u x (cosh u — 1) iiviiy fcosh u 1) «j(cosh«-l) t , 

u 2 u 2 M 2 + 1 

\ — ^ sinh u — yf sinh u — sinh n 

' tt u Li 


- — sinh m ^ 

U 

- — sinh u 

U 

- — sinh u 

U 

cosh u / 


(4.31) 


Now we know what boosts look like, but how about an arbitrary restricted 
Lorentz transformation? Well, the nice thing about 5(3(3, l) c is that any element A 
can be decomposed as A = LR' , where 


R’ = 



R e 5(3(3) 


(4.32) 


and L is of the form (4.29). This is the usual decomposition of an arbitrary restricted 
Lorentz transformation into a rotation and a boost, which you will perform in Prob- 
lem 4.5. Note that L has three arbitrary parameters, so that our arbitrary restricted 
Lorentz transformation LR' has six parameters total. 


Exercise 4.10 Construct an orthogonal matrix R which implements an orthonormal change 
of basis from the standard basis {x, y, z} to one of the form {L, ? 2 , fl] where the f; are any 
two vectors mutually orthonormal with ji and each other. Embed R in 50(3, 1) D as in (4.32) 
and use this to obtain (4.29) by performing a similarity transformation on (4.27). Parts of 
Problem 3.1 may be useful here. 

Exercise 4.11 Check that L in (4.29) really does represent a boost of velocity fi as follows: 
Use L as a passive transformation to obtain new coordinates (x\ y\z ! , t') from the old ones 
by 


(x'\ 


(x\ 

/ 

= L 

y 

z' 


z 

VV 


W 


Show that the spatial origin of the unprimed frame, defined by x = y = z = 0, moves with 
velocity —fi in the primed coordinate system, which tells us that the primed coordinate 
system moves with velocity +fi with respect to the unprimed system. 


Example 4.15 (3(3, 1): The extended Lorentz group 


In the previous example we restricted our changes of inertial reference frame to 
those which preserved the orientation of space and time. This is sufficient in classical 
mechanics, but in quantum mechanics we are often interested in the effects of space 
and time inversion on the various Hilbert spaces we are working with. If we add 
spatial inversion, also called parity and represented by the matrix 


/-I 0 0 0\ 

0-100 
0 0-10 
v 0 0 0 \) 


(4.33) 
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as well as time-reversal, represented by 

/I 0 0 0 \ 

010 0 
0 0 10 
v 0 0 0 — l j 


(4.34) 


to the restricted Lorentz group, we actually recover 0(3, 1), which is thus known 
as the improper or extended Lorentz group. You should verify that P, T e 0(3, 1), 
but P,T SO( 3, l) c . In fact, |P| = |7| = — 1, which is no accident; as in the case 
of the orthogonal group, the defining equation (4.14) restricts the determinant, and 
in fact implies that |A| = ±1. In this case, however, the group has four connected 
components instead of two! Obviously those matrices with |A| = 1 must be dis- 
connected from those with | A| = — 1, but those which reverse the orientation of the 
space axes must also be disconnected from those which do not, and those which 
reverse the orientation of time must be disconnected from those which do not. This 
is represented schematically in Fig. 4.2. Note that, as in the case of 0(3), multi- 
plication by the transformations P and T take us to and from the various different 
components. 


Example 4.16 SL( 2, C): Special linear group in two complex dimensions 


This group cannot be viewed as a group of isometries, but it is important in physics 
nonetheless. SL( 2, C) is defined to be the set of all 2 x 2 complex matrices A with 
| A | = 1. By now it should be obvious that this set is a group. The general form of 
A e SL( 2, C) is 


A = 



a, b, c, d e C, ad — be = 1. 


The unit determinant constraint means that A is determined by three complex pa- 
rameters or six real parameters, just as for 519(3, l) a . This is no coincidence; in fact, 
SL( 2, C) bears the same relationship to 5(9(3, 1) 0 as 5(7(2) bears to 5(9(3), in that 
SL(2, C) implements restricted Lorentz transformations on spin 1/2 particles! This 
will be discussed in the next section. You will also show later 11 that a boost with 
rapidity u is implemented by an SL( 2, C) matrix of the form 


/ cosh ^ + y sinh | — ^ (u x — iu y ) sinh (j 

y — ^ (u x + iiiy) sinh f cosh | — 7/ sinh | 


ueR 3 . (4.35) 


It can also be shown, 12 just as for 5(9(3, 1) 0 , that any A e SL( 2, C) can be decom- 
posed as A = LR, where R e 5(7(2) and L is as above. This, together with the 
facts that an arbitrary rotation can be implemented by R e 5(7(2) parametrized as in 
(4.26), yields the general form LR for an element of SL( 2, C) in terms of the same 
parameters we used for 5(9(3, 1) G . 


11 See Problem 4.8. 
12 See Problem 4.6. 
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Fig. 4.2 The four components of 0(3, 1). The proper Lorentz transformations in the upper-left 
corner are just 50(3, l) a . Note that the transformations in the lower-right hand comer change both 
the orientation of the space axes as well as time, and so must be disconnected from 50(3, 1)„ even 
though they have A = 1 . Also note that multiplying by the parity and time-reversal operators P 
and T take one back and forth between the various components 


4.3 Homomorphism and Isomorphism 

In the last section we claimed that there is a close relationship between SU( 2) and 
S0( 3), as well as between SL( 2, C) and 50(3, 1) G . We now make this relationship 
precise, and show that a similar relationship exists between S n and Z 2 . We will also 
define what it means for two groups to be ‘the same’, which will then tie into our 
somewhat abstract discussion of Z 2 in the last section. 

Given two groups G and //, a homomorphism from G to H is a map <t> : G If 
such that 


^(5152) = 4 >(gi) G>(g 2 ) Vgi,g 2 eG. 


(4.36) 


Note that the product in the left hand side of (4.36) takes place in G, whereas the 
product on the right hand side takes place in H . A homomorphism should be thought 
of as a map from one group to another which preserves the multiplicative structure. 
Note that <J> need not be one-to-one or onto; if it is onto then <t> is said to be a 
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homomorphism onto H, and if in addition it is one-to-one then we say <1> is an 
isomorphism. If $ is an isomorphism then it is invertible and thus sets up a one-to- 
one correspondence which preserves the group structure, so we regard G and H as 
‘the same’ group, just with different labels for the elements. When two groups G 
and H are isomorphic we write G — H . 

Exercise 4.12 Let <L : G — > H be a homomorphism, and let e be the identity in G and e' 
the identity in H . Show that 


O(e) = e' 

v S eG - 

Example 4.17 Isometries and the orthogonal, unitary, and Lorentz groups 

A nice example of a group isomorphism is when we have an n -dimensional vector 
space V (over some scalars C) and a basis B, hence a map 

GL(V) — »■ GL(n, C) 

T^[T] b . 

It is easily checked that this map is one-to-one and onto. Furthermore, it is a homo- 
morphism since [TU] = [T][!/], a fact you proved in Exercise 2.8. Thus this map is 
an isomorphism and GL(V ) ~ GL(n, C). If E has a non-degenerate Hermitian form 
(• | ■) then we can restrict this map to Isom(E) C GL(V), which yields 

Isom(E) ~ 0(n ) 

when V is real and (• | ■) is positive-definite, 

Isom(V) ~ U («) 

when V is complex and (• | ■) is positive-definite, and 

Isom(V) ~ 0(n — 1, 1) 

when V is real and (• | -) is a Minkoswki metric. These isomorphisms were implicit 
in the discussion of Examples 4. 3-4. 6, where we identified the operators in Isom(E) 
with their matrix representations in the corresponding matrix group. □ 

Example 4.18 Linear maps as homomorphisms 

A linear map from a vector space V to a vector space IV is a map <t> : V — > VV that 
satisfies the usual linearity condition 

0(cui + V2) = C4>(i>i) + <I>(i>2). (4.37) 

(A linear operator is then just the special case in which V = W.) In particular we 
have 4>(i>i + V 2 ) = <£>(vi) + O^), which just says that O is a homomorphism 
between the additive groups V and IV ! (cf. Example 4.2). If <t> is one-to-one and 
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onto then it is an isomorphism, and in particular we refer to it as a vector space 
isomorphism. 

The notions of linear map and vector space isomorphism are basic ones, and 
could have been introduced much earlier (as they are in standard linear algebra 
texts). Because of our specific goals in this book, however, we have not needed them 
yet. These objects will start to play a role soon, though, and will recur throughout 
the rest of the book. 

Exercise 4.13 Use an argument similar to that of Exercise 2.6 to prove that a linear map 
0 : V — > W is an isomorphism if and only if dim V = dim W and 0 satisfies 

0(u) = O => v = 0. 

Exercise 4.14 Before moving on to more complicated examples, let us get some practice 
by acquainting ourselves with a few more basic homomorphisms. First, show that the map 

exp : R -»• R* 
x m- e x , 

from the additive group of real numbers to the multiplicative group of nonzero real num- 
bers, is a homomorphism. Is it an isomorphism? Why or why not? Repeat the analysis for 
exp : C — > C* . Also, show that the map 

det : GL(n, C) — > C* 

A h-> det A 

is a homomorphism for both C = R and C = C. Is it an isomorphism in either case? Would 
you expect it to be? 

Exercise 4.15 Recall that U (1) is the group of 1 x 1 unitary matrices. Show that this is just 
the set of complex numbers z with |z| = 1, and that 1/(1) is isomorphic to 50(2). 

Suppose <J> is a homomorphism but not one-to-one. Is there a way to quantify 
how far it is from being one-to-one? Define the kernel of <f> to be the set K = 
{g e G | O(g) = e'j, where e' is the identity in H . In other words, K is the set of all 
elements of G that get sent to e ' under <t>. If O is one-to-one, then K = {e}, since 
there can be only one element in G that maps to e ' and by Exercise 4.12 that is e. If 
<J> is not one-to-one, then the size of K tells us how far it is from being so. Also, if 
we have <l>(gi) = Ofe) = h e H, then 

^(gi#^ ’) = = hh~ l = e 

so gig^ 1 is in the kernel of <t>, i.e. gig^ 1 = k e K. Multiplying this on the right 
by g 2 then gives g i = kgj, so we see that any two elements of G that give the 
same element of H under <t> are related by left multiplication by an element of K . 
Conversely, if we are given g e G and <!>(<?) = h, then for all k e K, 

<t>(kg) = <P(k)®(g) = e'<t >(g) = 0(g) = h. 

Thus if we define Kg = {kg \ k e K}, then Kg are precisely those elements (no 
more, and no less) of G which get sent to h. Thus the size of K tells us how far <t> 
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is from being one-to-one, and the elements of K tell us exactly which elements of 
G will map to a specific element h e H. 

Homomorphisms and isomorphisms are ubiquitous in mathematics and occur 
frequently in physics, as we will see in the examples below. 

Exercise 4.16 Show that the kernel K of any homomorphism O : G — > H is a subgroup 
of G. Then determine the kernels of the maps exp and det of Exercise 4.14. 

Exercise 4.17 Suppose <t> : V — > W is a linear map between vector spaces, hence a homo- 
morphism between abelian groups. Conclude from the previous exercise that the kernel K 
of O is a sub space of V, also known as the null space of <t>. The dimension of K is known 
as the nullity of K . Also show that the range of <t> is a subspace of W , whose dimension is 
known as the rank of <t>. Finally, prove the rank-nullity theorem of linear algebra, which 
states that 


rank(<J>) + nullity(O) = dim V. 


(4.38) 


(Hint: Take a basis {ej , . . . , e *} for K and complete it to a basis {e\, . . . ,e n } for V, where 
n = dim V. Then show that {<t>(e£ + i) <t>(c„)} is a basis for the range of O.) 

Example 4.19 SU( 2) and 50( 3) 

In most physics textbooks the relationship between 50(3) and SU( 2) is described 
in terms of the ‘infinitesimal generators’ of these groups. We will discuss infinitesi- 
mal transformations in the next section and make contact with the standard physics 
presentation then; here we present the relationship in terms of a group homomor- 
phism p : 50(2) — >• 50(3), defined as follows: consider the vector space (check!) 
of all 2 x 2 traceless anti-Hermitian matrices, denoted as su(2) (for reasons we will 
explain later). You can check that an arbitrary element X e su(2) can be written as 



(4.39) 


If we take as basis vectors 





then we have 


X — xS x + yS y + zS z 


so the column vector corresponding to X in the basis B = { 5 A , 5 V , 5- } is 
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Note that 

det X — -j- (x 2 + y 2 + z 2 ) — ^ || [X] 1 ~ 

so the determinant of X e su(2) is proportional to the norm squared of [X] e R 3 
with the usual Euclidean metric. Now, you will check below that A e SU( 2) acts on 
X e su(2) by the map X i->- AX A ’ , and that this map is linear. Thus, this map is a 
linear operator on su(2), and can be represented in the basis B by a 3 x 3 matrix 
which we will call p(A), so that [AXA 4 ] = p(A)[X ] where p(A) acts on [X] by 
the usual matrix multiplication. Furthermore, 

||p(A)[X]|| 2 = ||[AXA t ]|| 2 = 4det(AXA t ) = 4detX = ||[X]|j 2 (4.40) 

so p(A) preserves the norm of X. This implies (see Exercise 4.19 below) that 
p(A) e 0(3), and one can in fact show 13 that detp(A) = 1, so that p(A) e SO( 3). 
Thus we may construct a map 

p : 51/(2) -» 50(3) 

Ah^p(A). 

Furthermore, p is a homomorphism, since 

p(AB)[X] = [(Afi)X(Afi) 1 '] = [A5Xfi t A t ] = p(A)[BXB l ] 

— p{A)p(B)[X] (4.41) 

and hence p(AB) — p(A)p(B). Is p an isomorphism? One can show 14 that p is 
onto but not one-to-one, and in fact has kernel K — {I, — I}. From the discussion 
preceding this example, we then know that p(A) = p(— A) VA e 50(2) (this fact 
is also clear from the definition of p), so for every rotation R e 50(3) there corre- 
spond exactly two matrices in 50(2) which map to R under p. Thus, when trying 
to implement a rotation R on a spin 1 / 2 particle we have two choices for the 50(2) 
matrix we use, and it is sometimes said that the map p _1 from 50(3) to 50(2) is 
double-valued. In mathematical terms one does not usually speak of functions with 
multiple-values, though, so instead we say that 50(2) is the double cover of 50(3), 
since the map p is onto (‘cover’) and two-to-one (‘double’). 


13 See Problem 4.7 of this chapter, or consider the following rough (but correct) argument: 
p : SU( 2) — 0(3) as defined above is a continuous map, and so the composition 

det o p : SU{ 2) — > R 

A m- det(p(A)) 

is also continuous. Since SU(2) is connected any continuous function must itself vary continuously, 
so det o p cannot jump between 1 and — 1 , which are its only possible values. Since det(p(/)) = 1, 
we can then conclude that det(p(A)) = 1 VA e SU( 2). 

14 See Problem 4.7 again. 
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Exercise 4.18 Let A G 517(2), X G su(2). Show that AXA t G su(2) and note that 
A(X + Y)A Jf = AXA^ + AYA\ so that the map X — > AXA ’ really is a linear operator 
on su(2). 

Exercise 4.19 Let V be a real vector space with a metric g and let R G C(V) preserve 
nonns on V, i.e. g(Rv , Rv) = g(v, v) Vu G V . Show that this implies that 

g(Rv, Rui) = g(v, w) Vv.weV, 

i.e. that R is an isometry. Hint: consider g(v + t v,v + w) and use the bilinearity of g. 


Example 4.20 SL( 2, C) and SO( 3, 1) G 


Just as there is a two-to-one homomorphism from 5(7(2) to 5(9(3), there is a two-to- 
one homomorphism from SL( 2, C) to 5(9(3, 1 ) c which is defined similarly. Consider 
the vector space // 2 (C) of 2 x 2 Hermitian matrices. As we saw in Example 2.9, this 
is a four-dimensional vector space with basis B — {a x ,<y y ,a z , I), and an arbitrary 
element X e Hi (C) can be written as 


so that 


X = 


( t + z 
+ iy 


x - 
t - 


iy 

- z 


— xa x + ya y +Z(T z + tI 


fx\ 


[X] = 


y 

z 


VJ 


(4.42) 


Now, SL(2, C) acts on // 2 (C) in the same way that SU(2) acts on su(2): by sending 
X -* AXA ' where A e SL( 2, C). You can again check that this is actually a linear 
map from // 2 (C) to itself, and hence can be represented in components by a matrix 
which we will again call p(A). You can also check that 


det X = r - x 2 - y 2 - z 2 = ~r]([X], [X]) 

so the determinant of X e // 2 (C) gives minus the norm squared of [X] in the 
Minkowski metric on R 4 . As before, the action of p(A) on [X] preserves this 
norm (by a calculation identical to (4.40)), and you will show in Problem 4.8 that 
detp(A) = 1 and p(A) 44 > 1, so p(A) e 5(9(3, 1) G . Thus we can again construct a 
map 


p : SL( 2, C) -* 5(9(3, 1 ) 0 
A i->- p(A) 

and it can be shown 15 that p is onto. Furthermore, p is a homomorphism, by a 
calculation identical to (4.41). The kernel of p is again K — {/, — 1} so SL( 2, C) is 
a double-cover of 5(9(3, 1) G . 


15 See Problem 4.8 again. 
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Exercise 4.20 The elements of SL( 2, C) that correspond to rotations should fix timelike 
vectors, i.e. leave them unchanged. Identify the elements of Hj(C) that correspond to time- 
like vectors, and show that it is precisely the subgroup 517(2) C SL( 2, C) which leaves them 
unchanged, as expected. 

Example 4.21 Z 2 , parity and time-reversal 

Consider the set {/, P} C 0(3, 1). This is an abelian two-element group with 
P 2 = /, and so looks just like Zt. In fact, if we define a map <!> : {/, P} -> Zt 
by 

<*>(/) = 1 
<b(P) = - 1 

then $ is a homomorphism since 

< D(P ■ P ) = 0(7) = 1 = (-1) 2 = <D(7)<D(P). 

4> is also clearly one-to-one and onto, so $ is in fact an isomorphism! We could also 
consider the two-element group {/, T } e 0(3, 1); since T 2 = I , we could define a 
similar isomorphism from {/, T } to Z 2 . Thus, 

Z 2 ~{I,P}~{I,T}. 

In fact, you will show below that all two-element groups are isomorphic. That is 
why we chose to present Z 2 somewhat abstractly; there are many groups in physics 
that are isomorphic to Z 2 , so it makes sense to use an abstract formulation so that 
we can talk about all of them at the same time without having to think in terms of 
a particular representation like {/, I 1 } or {/, T } . We will see the advantage of this in 
the next example. 

Exercise 4.21 Show that any group G with only two elements e and g must be isomorphic 
to 7j2 . To do this you must define a map : G —>■ Z 2 which is one-to-one, onto, and which 
satisfies (4.36). Note that S 2 , the symmetric group on two letters, has only two elements. 
What is the element that corresponds to — 1 e Z 2 ? 

Example 4.22 S n . Z 2 , and the sgn homomorphism 

In Sect. 3.8 we discussed rearrangements, transpositions, and the evenness or odd- 
ness of a rearrangement in terms of the number of transpositions needed to obtain 
it. We are now in a position to make this much more precise, which will facilitate 
neater descriptions of the e tensor, the determinant of a matrix, and the symmetriza- 
tion postulate. 

We formally define a transposition in S n to be any permutation a which switches 
two numbers i and j and leaves all the others alone. You can check that en and cti • 02 
from Example 4.9 are both transpositions. It is a fact that any permutation can be 
written (non-uniquely) as a product of transpositions, though we will not prove this 
here. 16 As a simple example, though, you can check that 


16 See Herstein [91 for a proof and nice discussion. 
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(\ 2 3\ (\ 2 3\/l 2 3\ 

3 1 y y 3 2 lj\2 1 3 J 

is a decomposition of a j into transpositions. 

Though the decomposition of a given permutation is far from unique (for in- 
stance, the identity can be decomposed as a product of any transposition a and its 
inverse), the evenness or oddness of the number of transpositions in a decomposition 
is invariant. For instance, even though we could write the identity as 


e = e 


e — g\o y 

(72(^2 

and so on for any transpositions o\, 02 , . . ., every decomposition will consist of an 
even number of transpositions. Likewise, any decomposition of the transposition 
(^ 13 ) w '^ consist of an odd number of transpositions. A general proof of this fact 
is relegated to Problem 4.9. What this allows us to do, though, is define a homomor- 
phism sgn: S„ — > Z 2 by 


sgn(ff) 


• 1 if a consists of an even number of transpositions 
— 1 if a consists of an odd number of transpositions. 


You should be able to verify with a moment’s thought that 


sgn(cri<72) = sgn(eri ) sgn(<7 2 ), 


so that sgn is actually a homomorphism. If sgn(cr) = +1 we say a is even, and if 
sgn(cr) = — 1 we say a is odd. 

With the sgn homomorphism in hand we can now tidy up several definitions 
from Sect. 3.8. First of all, we can now define the wedge product of r dual vectors 
fi , i = 1 , . . . , r to be 


/l A ■ ■ • A fr = ^2 Sg n ( CT )/<x(l) ® fa( 2) ® ® fair)- (4.43) 

<T &Sf- 


You should compare this with the earlier definition and convince yourself that the 
two definitions are equivalent. Also, from our earlier definition of the e tensor it 
should be clear that the nonzero components of e are 


= sgn((T ) where cr = 
and so the definition of the determinant, (3.85), becomes 
| A| = sgn(er)Ai CT( i) . . . 

Ancr(n) • 

cr€S n 



(4.44) 


(4.45) 


Finally, at the end of Example 4.9 we described how S n acts on an n-fold tensor 
product T°(V) by 


CT(Ul ® V2 ® • • ' 0 1V) = IV(1) ® IV(2) ® • • • ® V a (n), 


(4.46) 
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where the action of a on a generic element on T® ( V ) (which would be a sum 
of product states like the above) is determined by linearity. If we have a totally 
symmetric tensor T — T l '- ln e^ ® • • • ® € 5”(V), we then have 

a(T) = T n - ln e a (i l) ® • • • ® e a{in) 

= T a ~ l(h) ~ a ~ l{jn) e h ® • • • ® 
where we have relabeled indices using = cr(/^) 

— jh ® • • • ® by total symmetry of 7’- 71 '"- 7 " 

= T 

so all elements of S n ( V ) are fixed by the action of S„. If we now consider a totally 
antisymmetric tensor T — T n - ln ei 1 ® • • • ® e,„ e A"(V), then the action of a e S n 
on it is given by 

a(T) = T n ' ln e rT (i l ) ® • • • ® e a(in) 

= r «T- 1 o 1 )-<T- 1 0„)^. i ®...®^ 

= sgn(a)T J< "' J "ej t ® • • • ® ej n by antisymmetry of T Jl ' Jn 
= sgn (a)T 

so if a is odd then T changes sign under it, and if a is even then T is invariant. 
Thus, we can restate the symmetrization postulate as follows: any state of an n- 
particle system is either invariant under a permutation of the particles (in which 
case the particles are known as bosons), or changes sign depending on whether the 
permutation is even or odd (in which case the particles are known as fermions). 


4.4 From Lie Groups to Lie Algebras 

You may have noticed that the examples of groups we met in the last two sec- 
tions had a couple of different flavors: there were the matrix groups like 5(7(2) and 
50(3) which were parametrizable by a certain number of real parameters, and then 
there were the ‘discrete’ groups like Z 2 and the symmetric groups 5„ that had a 
finite number of elements and were described by discrete labels rather than contin- 
uous parameters. The first type of group forms an extremely important subclass of 
groups known as Lie Groups, named after the Norwegian mathematician Sophus 
Lie who was among the first to study them systematically in the late 1800s. Besides 
their ubiquity in math and physics, Lie groups are important because their contin- 
uous nature means that we can study group elements that are ‘infinitely close’ to 
the identity; these are known to physicists as the ‘infinitesimal transformations’ or 
‘generators’ of the group, and to mathematicians as the Lie algebra of the group. 
As we make this notion precise, we will see that Lie algebras are vector spaces and 
as such are sometimes simpler to understand than the ‘finite’ group elements. Also, 
the ‘generators’ in some Lie algebras are taken in quantum mechanics to represent 
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certain physical observables, and in fact almost all observables can be built out of el- 
ements of certain Lie algebras. We will see that many familiar objects and structures 
in physics can be understood in terms of Lie algebras. 

Before we can study Lie algebras, however, we should make precise what we 
mean by a Lie group. Here we run into a snag, because the proper and most general 
debnition requires machinery well outside the scope of this text. 17 We do wish to be 
precise, though, so we follow Hall [8] and use a restricted definition. This definition 
does not really capture the essence of what a Lie Group is, but will get the job done 
and allow us to discuss Lie algebras without having to wave our hands. 

That said, we define a matrix Lie group to be a subgroup G C GL(n , C) which 
is closed , in the following sense: for any sequence of matrices A„ e G which con- 
verges to a limit matrix A, either A e G or A ^ GL(n, C). This says that a limit of 
matrices in G must either itself be in G, or otherwise be noninvertible. As remarked 
above, this definition is technical and does not provide much insight into what a Lie 
group really is, but it will provide the necessary hypotheses in proving the essential 
properties of Lie algebras. 

Let us now prove that some of the groups we have encountered above are indeed 
matrix Lie groups. We will verify this explicitly for one class of groups, the orthog- 
onal groups, and leave the rest as problems for you. The orthogonal group 0(n) is 
defined by the equation R~ l = R T , or R 1 R — I . Let us consider the function from 
GL(n,M) to itself defined by /(A) = A T A. Each entry of the matrix /(A) is easily 
seen to be a continuous function of the entries of A, so / is continuous. Consider 
now a sequence Rj in O ( n ) that converges to some limit matrix R. We then have 



= lim f(Ri) since / is continuous 


= lim I 


= I 


so R e 0{n). Thus O(n) is a matrix Lie group. The unitary and Lorentz groups, 
as well as their cousins with unit determinant, are similarly defined by continuous 
functions, and can analogously be shown to be matrix Lie groups. For an example 
of a subgroup of GL(n, C) which is not closed, hence not a matrix Lie group, see 
Problem 4.10. 

We remarked earlier, though, that the above definition does not really capture 
the essence of what a Lie group is. What is that essence? As mentioned before, one 
should think of Lie groups as groups which can be parametrized in terms of a certain 
number of real variables. This number is known as the dimension of the Lie group, 
and we will see that this number is also the usual (vector space) dimension of its 
corresponding Lie algebra. Since a Lie group is parametrizable, we can think of it as 


17 The necessary machinery being the theory of differentiable manifolds; in this context, a Lie 
group is essentially a group that is also a differentiable manifold. See Schutz [15] or Frankel [4] 
for very readable introductions for physicists, and Warner [18] for a systematic but terse account. 
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a kind of multi-dimensional space (like the surface of a sphere) that also has a group 
structure. Unlike the surface of a sphere, though, a Lie group has a distinguished 
point, the identity e, and as we mentioned above, studying transformations ‘close 
to’ the identity will lead us to Lie algebras. 

For the sake of completeness, we should point out here that there are Lie groups 
out there which are not matrix Lie groups, i.e. which cannot be described as a subset 
of GL(n, C) for some n. Their relevance for basic physics has not been established, 
however, so we do not consider them here. 18 

Now that we have a better sense of what Lie groups are, we would like to zoom 
in to Lie algebras by considering group elements that are ‘close’ to the identity. For 
concreteness consider the rotation group SO( 3). An arbitrary rotation about the z 
axis looks like 



If we take our rotation angle to be e <£ 1 , we can approximate R- (e) by expanding 
to first order, which yields 



(4.47) 


where you may recognize 



from (2.16), up to a factor of i. The appearance here of L- and the discrepancy of 
a factor of i will be explained soon. Now, we should be able to describe a finite 
rotation through an angle 0 as an n-fold iteration of smaller rotations through an 
angle 0/n. As we take n larger, then 0 j n becomes smaller and the approximation 
(4.47) for the smaller rotation through e — 9/n becomes better. Thus, we expect 



to become an equality in the limit n -> oo. Flowever, you should check, if it is not 
already a familiar fact, that 


n=\j 

for any real number or matrix X. Thus, we can write 



,x 


(4.48) 


R z (0) = e eL * 


(4.49) 


18 See Hall [8] for further information and references. 
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where from here on out the exponential of a matrix or linear operator is defined by 
the power series in (4.48). Notice that the set {R-(9) = e eLz | 9 e R} is a subgroup 
of SO( 3); this can be seen by explicitly checking that 

R-M)R z (e 2 ) = R z (0i+92), (4.50) 

or using the property 19 of exponentials that e x e Y = e x+Y , or recognizing intuitively 
that any two successive rotations about the z-axis yields another rotation about the 
Z-axis. Notice that (4.50) says that if we consider R z (9) to be the image of 9 under 
a map 


R z :R->SO(3), (4.51) 

then R z is a homomorphism! Any such continuous homomorphism from the addi- 
tive group R to a matrix Lie group G is known as a one-parameter subgroup. One- 
parameter subgroups are actually very familiar to physicists; the set of rotations 
in R 3 about any particular axis (not just the z-axis) is a one-parameter subgroup 
(where the parameter can be taken to be the rotation angle), as is the set of all boosts 
in a particular direction (in which case the parameter can be taken to be the absolute 
value u of the rapidity). We will also see that translations along a particular direction 
in both momentum and position space are one-parameter subgroups as well. 

If we have a matrix X such that e' x e G Vr e M, then the map 


exp : J 


1 1 ->- e 


G 

tx 


(4.52) 


is a one-parameter subgroup, by the above-mentioned property of exponentials. 
Conversely, if we have a one-parameter subgroup y : R — >■ G, then we know that 
y (0) = I (since y is a homomorphism), and, defining 


we have 


x = A (0) , 

dt 


d Y , s y(h+t)-y(t) 

— ( t ) = ltm 

dt /?-> o h 


= lim 

*->■ 0 


y{h)y(t) - y(t) 
h 


y(h) - I 

= 13 ^ 4 — 

= lim LRz: rV> m 
h->0 h r 


= Xy(t). 


(4.53) 


19 This is actually only true when X and Y commute; more on this later. 


4.5 Lie Algebras — Definition, Properties, and Examples 


115 


You should recall 20 that the first order linear matrix differential equation = 
Xy(t) has the unique solution y(t) = e ,x . Thus, every one-parameter subgroup is 
of the form (4.52), so we have a one-to-one correspondence between one-parameter 
subgroups and matrices X such that e ,x e G V/ e R. The matrix X is sometimes 
said to ‘generate’ the corresponding one-parameter subgroup (e.g. L- ‘generates’ 
rotations about the "-axis, according to (4.49)), and to each X there corresponds 
an ‘infinitesimal transformation’ I + eX. Really, though, X is best thought of as a 
derivative (by (4.53)), and (4.47) is best thought of as the first two terms of a Taylor 
expansion, as in 

R z (0)= R z (Q)+e d ^(0) + --- (4.54) 

dd 

(you should verify that (0) = L - !). These matrices corresponding to one- 
parameter subgroups have many useful properties and their study leads to many 
interesting applications, both in mathematics and physics, so we give them a name: 
Lie algebras. 


4.5 Lie Algebras — Definition, Properties, and Examples 

Given a matrix Lie group G C GL(«, C), we define its Lie algebra g to be the set of 
all matrices X e M n (C) such that e tX e G Vr e M. The first goal of this section is to 
establish some of the basic properties of Lie algebras. This will require a somewhat 
formal discussion, however, so we begin with a quick look at the Lie algebras of a 
couple of familiar Lie groups, so that you have examples to keep in mind later. 

First, let us consider GL(n, C) and its Lie algebra g[ («, C). How can we describe 
g[(fz,C)? If X is any element of M„( C). then e' x is invertible for all t, and its 
inverse is merely e~ ,x . Thus 


gl(n,C) = M„(C). 


(4.55) 


Likewise, 


gl(«, M) = M„(R). (4.56) 

Second, consider our good friend 0(3), whose Lie algebra is denoted o(3). How 
can we describe o (3) ? For any X e o(3), we have e ,x e 0(3) Vr e M, which means 
that 

(e' x ) T e ,x = e ,{xT) e ,x = I Vr. (4.57) 

Differentiating this with respect to t yields 

X T e txT e tX + e txT Xe‘ x = 0, (4.58) 


20 See any standard text on linear algebra and linear ODEs. 
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which we can evaluate at t = 0 to give 

X T +X = 0. (4.59) 

Thus, any X e o(3) must be antisymmetric! Will any real, antisymmetric X do? For 
any such X, 

(e tX ) T e tX = e t ^e tX 
= e~ tX e tX 

= I 

so e' x e 0(3) Vf e M, hence X e o ( 3) . Thus, o (3) is the set of all real antisymmetric 
matrices. 

Now, £)((«, C), gl(«, R), and o(3) all share a few nice properties. First off, they 
are all real vector spaces, as you can easily check (g[(n , C) can also be considered 
a complex vector space, but we will not consider it as such in this text). Secondly, 
they are closed under commutators , in the sense that if X and Y are elements of the 
Lie algebra, then so is 


[X, Y] = XY — YX. (4.60) 

This is trivial to verify in the case of g[(n, C), and simple to verify in the case of 

0 (3) . Thirdly, these Lie algebras (and, in fact, all sets of matrices) satisfy the Jacobi 
Identity, 

[[X,Y],Z] + [[Y,Z],X] + [[Z,X],Y] = 0 VX, Y, Z e g. (4.61) 

This can be verified directly by expanding the commutators, and does not depend on 
any special properties of g, only on the definition of the commutator. We will have 
more to say about the Jacobi identity later. 

The reason we have singled out these peculiar-seeming properties is that they 
will turn out to be important, and it turns out that all Lie algebras of matrix Lie 
groups enjoy them: 

Proposition 4.1 Let g be the Lie algebra of a matrix Lie group G. Then g is a real 
vector space, is closed under commutators, and all elements of g obey the Jacobi 
identity. 

Proving this turns out to be somewhat technical, so we will just sketch a proof 
here and refer you to Hall [8] for the details. Let g be the Lie algebra of a matrix 
Lie group G, and let X, Y eg. We’d first like to show that X + Y e g. Since g is 
closed under real scalar multiplication (why?), proving X +Y e g will be enough to 
show that g is a real vector space. The proof of this hinges on the following identity, 
known as the Lie Product Formula, which we state but do not prove: 

e x+Y — lim (gSes)'". (4.62) 

M — >00 V 7 

This formula should be thought of as expressing the addition operation in g in terms 

x r 

01 the product operation in G. With this in hand, we note that A m = (e m e m ) m is a 
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convergent sequence, and that every term in the sequence is in G since it is a product 
of elements in G. Furthermore, the limit matrix A = e x+1 is in GL(n , C) by (4.55). 
By the definition of a matrix Lie group, then, A = e x+Y e G, and thus g is a real 
vector space. 

The second task is to show that g is closed under commutators. First, we claim 
that for any X e g[(n, C) and A e GL(n, (C), 

e AXA ~ 1 =Ae x A-\ (4.63) 


You can easily verify this by expanding the power series on both sides. This implies 
that if X e g and A e G, then AXA ~ 1 e g as well, since 

e ,Axr ‘ = Ae ,x A~ l e G Wt e R. (4.64) 

Now let A — e rY , Leg. Then e tY Xe~ lY e g Vf, and we can compute the derivative 
of this expression at t — 0: 


d 

dt 


e' Y Xe~ ,Y 


= Ye ,Y Xe~ ,Y 


t = o 


r=0 


-e tY Xe~ tY Y I 


f=0 


= YX — XY. 


(4.65) 


Since we also have 


d_ y 

dt 


Xe 


-tY 


t = 0 


lim 

h^O 


e hY Xe~ hY - X 
h 


(4.66) 


and the right side is always in g since g is a vector space, 21 this shows that YX — 
XY = [Y, X] is in g. 

The third and final task would be to verify the Jacobi identity, but as we pointed 
out above this holds for any set of matrices and can be easily verified by direct 
computation. Thus the proposition is established. □ 


Before moving on to a discussion of specific Lie algebras, we should discuss the 
significance of the commutator. We proved above that all Lie algebras are closed 
under commutators, but so what? Why is this property worth singling out? Well, 
it turns out that the algebraic structure of the commutator on g is closely related 
to the algebraic structure of the product on G. This is most clearly manifested in 
the Baker-Campbell-Hausdorff (BCH) formula, which for X and Y sufficiently 
small 22 expresses e x e Y as a single exponential: 

e x e Y =e X+Y+\[X,Y]+-t 1 [X,lX,Y]]--hlY,[X,Y]]+- \ (4 6 g) 


21 To be rigorous, we also need to note that g is closed in the topological sense, but this can be 
regarded as a technicality. 

22 The size of a matrix X e M„ (C) is usually expressed by the Hilbert-Schmidt norm, defined as 

n 

imi= £ l*, 7 | 2 . (4.67) 

U= 1 

If we introduce the basis {E,j} of M„(C) from Example 2.8, then we can identify M„(C) with 

2 

C" , and then (4.67) is just the norm given by the standard Hermitian inner product. 
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It can be shown- that the series in the exponent converges for such X and Y, and 
that the series consists entirely of iterated commutators, so that the exponent really 
is an element of g (if the exponent had a term like XL in it, this would not be 
the case since matrix Lie algebras are in general not closed under ordinary matrix 
multiplication). Thus, the BCH formula plays a role analogous to that of the Lie 
product formula, but in the other direction: while the Lie product formula expresses 
Lie algebra addition in terms of group multiplication, the BCH formula expresses 
group multiplication in terms of the commutator on the Lie algebra. The BCH 
formula thus tells us that much of the group structure of G is encoded in the 
structure of the commutator on g. You will calculate the hist few terms of the 
exponential in the BCH formula in Problem 4.11. 

Now we are ready to look at the Lie algebras of some familiar matrix Lie groups. 

Example 4.23 o (n) and u(n): The Lie algebras of 0(n ) and U (n) 

Recall from the beginning of this section that o(3) is just the set of all real, anti- 
symmetric 3x3 matrices. You should note that the calculations which proved this 
did not depend in any way on n, so we can conclude in general that o(n) is just the 
set of all real, antisymmetric n x n matrices. You can check that the matrices A, ; 
from Example 2.8 form a basis for o(n), so that dimo(n) = . As we will see 

in detail in the n = 3 case, the A jj can be interpreted as generating rotations in the 
i-j plane. 

As for u(n), if we start with defining condition of U ( n ), A ' A ~ 1 = /, and perform 
the same manipulations that led from (4.57) to (4.59), we find that for any X e u(«), 

X T = -X. (4.69) 

In other words, X must be anti-Hermitian. Furthermore, the same argument that 
showed that any antisymmetric matrix is in o(«) also shows that any anti-Hermitian 
matrix is in u (n), so u in) is the set of all n x n anti-Hermitian matrices. In Exer- 
cise 2.5 you constructed a basis for H n (C) , and as we mentioned in Example 2.4, 
multiplying a Hermitian matrix by i yields an anti-Hermitian matrix. Thus, if we 
multiply the basis from Exercise 2.5 by i we get a basis for u(n), and it then follows 
that dimu(n) = n 2 . Note that a real antisymmetric matrix can also be considered an 
anti-Hermitian matrix, so that o(n) C u (n). □ 

Aside Before meeting some other Lie algebras, we have a loose end to tie up. We claimed earlier 
that most physical observables are best thought of as elements of Lie algebras (we will justify this 
statement in Sect. 4.7). However, we know that those observables must be represented by Hermi- 
tian operators (so that their eigenvalues are real), and yet here we see that the elements of o(n) and 
u(n) are anti- Hermitian! This is where our mysterious factor of i, which we mentioned in the last 
section, comes into play. Strictly speaking, o(n) and u (n) are real vector spaces whose elements are 
anti-Hermitian matrices. However, if we permit ourselves to ignore the real vector space structure 
and treat these matrices just as elements of M n ( C), we can multiply them by i to get Hermitian 
matrices which we can then take to represent physical observables. In the physics literature, one 


23 See Hall [8] for a nice discussion and Varadarajan [17] for a complete proof. 
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usually turns this around and defines generators of transformations to be these Hermitian observ- 
ables, and then multiplies by i to get an anti-Hermitian operator which can then be exponentiated 
into an isometry. So one could say that the physicist’s definition of the Lie algebra of a matrix Lie 
group G is g p hysics = {X e GL(n, C) | e ,lX e G Vf e R}. In the rest of this book we will stick with 
our original definition of the Lie algebra of a matrix Lie group, but you should be aware that the 
physicist’s definition is often implicit in the physics literature. 

Example 4.24 o(n — 1, 1): The Lie algebra of 0(n — 1,1) 

Let X e o(n — 1,1), the Lie algebra of the Lorentz group ()(n — 1, 1). What 
do we know about XI By the definition of a Lie algebra, and letting [?/] = 
Diag(l, 1, . . . , 1, — 1), we have 

e' xT [ri]e rX = [n] Vf. 

Differentiating with respect to t and evaluating at t — 0 as before yields 

X T [i 1 ] + WX = 0. (4.70) 

Writing X out in block form with an (n — 1) x (n — 1) matrix X' for the spatial 
components (i.e. the X,j where i, j < n) and vectors a and b for the components 
Xj n and X n i , i < ;? , the above equation reads 



This implies that X has the form 

X = ( X & l'eo(«-l), aef'- 1 . 

One can think of X' as generating rotations in the n — 1 spatial dimensions, and 
a generating a boost along the direction it points in R" _1 . We will discuss this in 
detail in the case n = 4 in the next section. □ 


We have now described the Lie algebras of the isometry groups 0(n), U(n), 
and 0(n — 1,1). What about their cousins SO(n ) and SU(n)l Since these groups 
are defined by the additional condition that they have unit determinant, examining 
their Lie algebras will require that we know how to evaluate the determinant of 
an exponential. This can be accomplished via the following beautiful and useful 
formula: 


Proposition 4.2 For any finite-dimensional matrix X, 


dete x =e TrX . 


(4.71) 


A general proof of this is postponed to Problem 4.14, but we can gain some 
insight into the formula by proving it in the case where X is diagonalizable. Recall 
that diagonalizable means that there exists A e GL(n , C) such that 

AXA~ l = Diag(ki, X 2 , . . . , k„) 
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where Diag(l[ , A. 2 , . . . , A.„) is a diagonal matrix with /,, in the / th row and column, 
and zeros everywhere else. In this case, we have 

dete x = det (Ae x 

= dete AXA 1 (4.72) 

= detDiag(e' Vl , e Xl , . . . , e A ") (see exercise below) (4.73) 
= e M e H ---e K 

— ^A.i+A. 24 h^-n 

= e Tr * 

and so the formula holds. □ 

Exercise 4.22 Prove (4.73). That is, use the definition of the matrix exponential as a power 
series to show that 


e Diag = Diag(e Al , e x \ . . . , e K ). (4.74) 

In addition to being a useful formula, Proposition 4.2 also provides a nice geo- 
metric interpretation of the Tr functional. Consider an arbitrary one-parameter sub- 
group {e ,x } C GL(n,C). We have 

dete rX = e Tl,x = e tTlX , (4.75) 

so taking the derivative with respect to t and evaluating at t = 0 gives 

= TrX. (4.76) 

t = 0 

Since the determinant measures how an operator or matrix changes volumes (cf. 
Example 3.27), this tells us that the trace of the generator X gives the rate at which 
volumes change under the action of the corresponding one-parameter subgroup e ,x . 

Example 4.25 so(n) and su(n), the Lie algebras of SO(n ) and SU(n ) 

Recall that SO(n ) and SU(n ) are the subgroups of O(n) and iJ (n) consisting of 
matrices with unit determinant. What additional condition must we impose on the 
generators X to ensure that dete rX = 1 Vf? From (4.71), it is clear that dete rX = 1 
Vf if and only if 

TrX = 0. (4.77) 

In the case of o(n), this is actually already satisfied, since the antisymmetry con- 
dition implies that all the diagonal entries are zero. Thus, so(n) = o(n), and both 
Lie algebras will henceforth be denoted as so(n). We could have guessed that they 
would be equal; as discussed above, generators X are in one-to-one correspondence 
with 'infinitesimal' transformations I + eX, so the Lie algebra just tells us about 


d 

— dete 
dt 


tx 
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transformations that are close to the identity. However, the discussion from Exam- 
ple 4.12 generalizes easily to show that O(n) is a group with two components, and 
that the component which contains the identity is just SO(n). Thus the set of ‘in- 
finitesimal’ transformations of both groups should be equal, and so should their Lie 
algebras. The same argument applies to SO( 3, 1) 0 and 0(3, 1); their Lie algebras 
are thus identical, and will both be denoted as so(3, 1). 

Lor su(n) the story is a little different. Here, the anti-Hermiticity condition only 
guarantees that the trace of an anti-Hermitian matrix is pure imaginary, so demand- 
ing that it actually be zero is a bona fide constraint. Thus, su(n) can without re- 
dundancy be described as the set of traceless, anti-Hermitian n x n matrices. The 
tracelessness condition provides one additional constraint beyond anti-Hermiticity, 
so that 

dimsu(n) = dimu(n) — 1 = n 2 — 1. (4.78) 

Can you find a nice basis for su(n)? 

Exercise 4.23 Show directly that so (n) and su(n) are vector spaces. Then prove the cyclic 
property of the Trace functional, 


Tr(A 1 A 2 ---A„) = Tr(A 2 ---A„Ai), A,sM„( C) (4.79) 

and use this to show directly that so (n ) and su(n) are closed under commutators. 


4.6 The Lie Algebras of Classical and Quantum Physics 

We are now ready for a detailed look at the Lie algebras of the matrix Lie groups 
we discussed in Sect. 4.2. 


Example 4.26 so (2) 


As discussed in Example 4.23, so(2) consists of all antisymmetric 2x2 matrices. 
All such matrices are of the form 



and so so(2) is one-dimensional and we may take as a basis 


(4.80) 


X = 


You will explicitly compute that 


„ex . 


cost) — sin(9 
sin (9 cost) 


6e] 


(4.81) 


(4.82) 


so that X really does generate counterclockwise rotations in the x-y plane. 


Exercise 4.24 Verify (4.82) by explicitly summing the power series for e ex . Also, consider 
the vector field X -* induced by X on R 2 , defined as follows: for every r e R 2 , 
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X t {r) = Xr (4.83) 

where X acts on r by matrix multiplication. Sketch this vector field and show that the flow 
lines are circles. This gives direct geometric meaning to the matrix X , and is another sense 
in which ‘X generates rotations’. 


Example 4.27 so (3) 


As remarked in Example 4.23, the matrices A (/ form a basis for o(n). Specializing 
to n — 3 and renaming the Ajj as 

/0 0 0 \ / 0 0 l\ /0 -1 0\ 

L x = 0 0 -1 , L y = 0 0 0, L z = 1 0 0 , 

\0 1 0 / \-l 0 0/ \0 0 0/ 

(4.84) 

we can write an arbitrary element X e so (3) as 
/ 0 -z y \ 

X= z 0 — x I = xL x + yL y + zL z (4.85) 

\-y x o / 


where you should recognize L z from the previous section. Note that dimso(3) = 3, 
which is also the number of parameters in SO(3). You can check that the commuta- 
tors of the basis elements work out to be 


I Ay , Ly] — L z 
[Ly, L z \ = L x 
[L Z , L x ] = Ly 

or, if we label the generators with numbers instead of letters, 

3 

[Li, Lj] = T, ejjkLk. 
k=l 

(We will frequently change notation in this way when it proves convenient; hope- 
fully this causes no confusion.) These are, of course, the well-known angular mo- 
mentum commutation relations of quantum mechanics. The relation between the 
Lie algebra of so(3) and the usual angular momentum operators (2.1 1) will be ex- 
plained in the next chapter. Note that if we defined Hermitian so (3) generators by 
L' = i L j . the commutation relations would become 

l 1 7 

[ L\ , L'j] = ieijkL' k , 
k= 1 

which is the form found in most quantum mechanics texts. 

We can think of the generator X in (4.85) as generating a rotation about the axis 
[•^](L,-} = (x, y, z), as follows: for any v = ( v x , v y , v z ) e K 3 , you can check that 
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/ 0 -z y W v x \ 

Xv = z 0 -x I I v y I = (x, y, z) x (v x , v y , v z ) = [I]xt 

\—y x 0 J \ v zJ 

so that if v lies along the axis [X], then Xv — 0. This then implies that 


e tX [X] = [X] 

(why?), so e' x must be a rotation about [X] (since it leaves [X] fixed). In fact, if we 
take [X] to be a unit vector and rename it n, and also rename t as 9, you will show 
below that 

( n^(l — cos 9) + cos 9 n x n y ( \ — cost?) — n z sin# n x n z ( 1 — cos 9) + n y sin0 
n v n. v (l — cos0) + sin0 n~(l — cost)) + cos$ n y n z (l — cosd) — n x sin9 
n z n x (l — cos#) — n y sin# n z n y (\ — cos9) + n x sind n?(l — cos0) + cost! 

(4.86) 

proving our claim from Example 4.11 that this matrix represents a rotation about n 
by an angle 0 . 

The astute reader may have noticed that we already made a connection between 
antisymmetric 3x3 matrices and rotations back in Example 3.30. There we saw 
that if A(t ) was a time-dependent orthogonal matrix representing the rotation of a 
rigid body, then the associated angular velocity bivector (in the space frame) was 
[a>] = If we let A(t) be a one-parameter subgroup A(t) = e ,x generated by 

some X e so (3), then the associated angular velocity bivector is 



r ~, dA(t) x 
[a>] = — — A 
dt 

= Xe ,x e~ ,x 


= X. 


(4.87) 


Thus the angular velocity bivector is just the generator of the rotation! Further- 
more, applying the J map from Example 3.29 to both sides of (4.87) gives 


[co] = [X] 

and so the pseudovector a> is just the rotation generator expressed in coordinates [X]. 
Note that this agrees with our discussion above, where we found that [X] gave the 
axis of rotation. 


Exercise 4.25 Let X e so(3) be given by 

( 0 -n z n y \ 

n z 0 -n x (4.88) 

-n y n x 0 / 

where n = (n x , n y , n z ) is a unit vector. Verify (4.86) by explicitly summing the power series 
for e ex . 

Exercise 4.26 Using the basis B = {L x , L y , L z } for so(3), show that 


UX T]] e = Me x [F] b , 


(4.89) 
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which shows that in components, the commutator is given by the usual cross product on R 3 . 
(Note that on the left hand side of the above equation the two sets of brackets have entirely 
different meanings; one signifies the Lie bracket, while the other denotes the component 
representation of an element of the Lie algebra.) 


Example 4.28 su(2) 

We already met su(2), the set of 2 x 2 traceless anti-Hermitian matrices, in Exam- 
ple 4.19. We took as a basis 


1 

5 ^2 


0 —i 
-i 0 


1 

Sy = ~ 


-1 

0 


1 


-i 0 
0 i 


(4.90) 


Notice again that the number of parameters in the matrix Lie group SU(2 ) is equal to 
the dimension of its Lie algebra su(2). You can check that the commutation relation 
for these basis vectors is 


[Si, Sj] = Y, e ,jkS k 


(4.91) 


k= 1 


which is the same as the 50 (3) commutation relations! Does that mean that su(2) 
and so (3) are, in some sense, the same? And is this in some way related to the ho- 
momorphism from SU (2) to SO( 3) that we discussed in Example 4.19? The answer 
to both these questions is yes, as we will discuss in the next few sections. We will 
also see that, just as X = xL x + yL y + zL z e 50 (3) can be interpreted as generating 
a rotation about ( x , y , z) e R 3 , so can Y = xSj + yS 2 + zSj € su(2). 

Exercise 4.27 Let n = (n l , n 2 . « 3 ) be a unit vector. Show by direct calculation that 

^cos(0/2) — in z sin(0/2) (— in x — n y ) sin(0/2) 

(—in x + n y ) sin(0/2) cos(0/2) +in z sin(0/2) 

= cos (8 /2)I + 2 sin(0 /2) n' 5; . (4.92) 

You will use this formula in the problems to show that e a " s ‘ represents a rotation about the 
axis n by an angle 9 . 


Example 4.29 so(3, 1) 

From Example 4.24, we know that an arbitrary X e so(3, 1) can be written as 

(r a 
x -{ a 0 

with X' e so(3) and a e R 3 . Embedding the L, of so(3) into so(3, 1) as 

and defining new generators 
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/ 0 

0 

0 

1 \ 


(0 

0 

0 

°\ 


(0 

0 

0 

°\ 

0 

0 

0 

0 

K 2 = 

0 

0 

0 

1 

K 3 = 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

V 1 

0 

0 

V 



1 

0 

0 J 



0 

1 

0/ 


we have the following commutation relations (Exercise 4.28): 


3 

[Lj, L j] = "y ' (jjjkLk 
k=\ 

3 

[Li, K J ] = Y j e ljk K k (4.93) 

k= 1 

3 

[Ki, Kj ] = — y ^ejjkLk- 
k= 1 


As we mentioned in Example 4.24, the K , can be interpreted as generating boosts 
along their corresponding axes. In fact, you will show in Exercise 4.28 that 


e 


u‘Kj 


/ u 2 (cosh w — 1) . , 
2 1" 1 

UyU x (coshu— 1) 
u 2 

U Z U X (cosh M — 1) 

u 2 

V — sinh u 

\ u 


u x Uy (cosh M — 1 ) 


M^(cOShM — 1) 

5 i A 

u z 

u z u y (cosh M — 1) 


J sinh m 


w x w z (coshw— 1 
i/2 


UyU z (cosh M — 1) 

»y2 


(cosh w — 1) 
u 2 

— sinh u 


+ 1 


— sinh u \ 

U 

sinh u 

— sinh u 

U 

cosh u ) 


(4.94) 


which as we know from Example 4.14 represents a boost in the direction of u. Also, 
note that the last commutation relation in (4.93) says that if we perform two suc- 
cessive boosts in different directions, the order matters. Furthermore, the difference 
between performing them in one order and the other is actually a rotation! 


Exercise 4.28 Verify the commutation relations (4.93), and verify (4.94) by explicitly 
summing the exponential power series. 


Example 4.30 sl(2, C)« 

sl(2, Or is defined to be the Lie algebra of SL( 2, C), viewed as a real vector space. 
Since SL( 2, C) is just the set of all 2 x 2 complex matrices with unit determinant, 
sl(2. Or is just the set of all traceless 2x2 complex matrices, and thus could be 
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viewed as a complex vector space, though we will not take that point of view here. 24 
A (real) basis for sl(2, C)r is 


51 

5 2 

5 3 

Ki 

K 2 

k 3 


1 

2 

1 

2 

1 

2 

1 

2 

1 

2 

1 

2 



Note that Kj — iSj. This fact simplifies certain computations, such as the ones you 
will perform in checking that these generators satisfy the following commutation 
relations: 


3 

[S,-, Sj] — Y^jkSk 

k= 1 
3 

[Si, Kj] — Y*ijkKk 
k= 1 

3 

[Ki,Kj] = -Y^jkSk. 

k= 1 

These are identical to the so (3, 1) commutation relations! As in the case of su(2) and 
so(3), this is intimately related to the homomorphism from SL( 2, C) to SO( 3, 1) G 
that we described in Example 4.20. This will be discussed in Sect. 4.8. 


Exercise 4.29 Check that the Kj generate boosts, as you would expect, by explicitly calcu- 
lating e“‘N to get (4.35). 


24 The space of all traceless 2x2 complex matrices viewed as a complex vector space is denoted 
al(2, C) without the R subscript. In this case, the Sj suffice to form a basis. In this text, however, 
we will usually take Lie algebras to be real vector spaces, even if they naturally form complex 
vector spaces as well. You should be aware, though, that s 1(2, C) is fundamental in the theory of 
complex Lie algebras, and so in the math literature the Lie algebra of SL(2, C) is almost always 
considered to be the complex vector space a 1(2, C) rather than the real Lie algebra a 1(2, Or. We 
will have more to say about a 1(2, C) in the Appendix. 
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4.7 Abstract Lie Algebras 

So far we have considered Lie algebras associated to matrix Lie groups, and we 
sketched proofs that these sets are real vector spaces which are closed under com- 
mutators. As in the case of abstract vector spaces and groups, however, we can now 
turn around and use these properties to define abstract Lie algebras. This will clarify 
the nature of the Lie algebras we have already met, as well as permit discussion of 
other examples relevant for physics. 

That said, a (real, abstract) Lie algebra is defined to be a real vector space g 
equipped with a bilinear map [•,■]: g x g — > g called the Lie bracket which satisfies 

1. [X, Y] = -[T, X] VX, Y e g (Antisymmetry) 

2. [[X, F], Z] + [[F, Z], X] + [[Z, X],F] = OVX,F,Zeg (Jacobi identity) 

By construction, all Lie algebras of matrix Lie groups satisfy this definition 
(when we take the bracket to be the commutator), and we will see that it is pre- 
cisely the above properties of the commutator that make those Lie algebras useful 
in applications. Furthermore, there are some (abstract) Lie algebras that arise in 
physics for which the bracket is not a commutator, and which are not usually asso- 
ciated with a matrix Lie group; this definition allows us to include those algebras 
in our discussion. We will meet a few of these algebras below, but first we consider 
two basic examples. 

Example 4.31 gl(V): The Lie algebra of linear operators on a vector space 

Let V be a (possibly infinite-dimensional) vector space. We can turn C(V), the set 
of all linear operators on V , into a Lie algebra by taking the Lie bracket to be the 
commutator, i.e. 


[T, U] = TU — UT, T,U e C(V). (4.95) 

Note that this is a commutator of operators, not matrices, though of course there 
is a nice correspondence between the two when V is finite-dimensional and we 
introduce a basis. This Lie bracket is obviously antisymmetric and can be seen to 
obey the Jacobi identity, so it turns C(V) into a Lie algebra which we will denote 
by gl(y). 25 We will have more to say about gl(F) as we progress. 

Example 4.32 isom(V): The Lie algebra of anti-Hermitian operators 

Consider the setup of the previous example, except now let V be an inner product 
space. For any T e C(V), the inner product on V allows us (via (4.9)) to define 
its adjoint T^, and we can then define isom(F) C gKL) to be the set of all anti- 
Hermitian operators, i.e. those which satisfy 


25 There is a subtlety here: the vector space underlying 0[(L) is of course just C(V), so the differ- 
ence between the two is just that one comes equipped with a Lie bracket, and the other is considered 
as a vector space with no additional structure. 
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(4.96) 


You can easily verify that isom( V) is a Lie subalgebra of g[(L). (A Lie subalgebra 
of a Lie algebra g is a vector subspace t) C g that is also closed under the Lie bracket, 
and hence forms a Lie algebra itself. ) If V is n -dimensional and we introduce an 
orthonormal basis, we see that the matrix form of isom(L) is o(n) if V is real and 
u(n) if V is complex. 


Example 4.33 The Poisson bracket on phase space 


Consider a physical system with a 2n -dimensional phase space P parametrized by n 
generalized coordinates q , and the n conjugate momenta /?, . The set of all complex- 
valued, infinitely differentiable 26 functions on P is a real vector space which we 
will denote by C(P). We can turn CAP) into a Lie algebra using the Poisson bracket 
as our Lie bracket, where the Poisson bracket is defined by 


{f,g} = J2 


3/ 3g 


3 qi dpi 


dg_df_ 
3 qt dp, ' 


(4.97) 


The antisymmetry of the Poisson bracket is clear, and the Jacobi identity can be 
verified directly by a brute-force calculation. 

The functions in C(P ) are known as observables , and the Poisson bracket thus 
turns the set of observables into one huge 27 Lie algebra. The standard quantization 
prescription, as developed by Dirac, Heisenberg, and the other founders of quantum 
mechanics, is to then interpret this Lie algebra of observables as a Lie subalgebra 
of gl(7f) for some Hilbert space Ti (this identification is known as a Lie algebra 
representation, which is the subject of the next chapter). The commutator of the 
observables in g((7f ) is then just given by the Poisson bracket of the corresponding 
functions in C(P). Thus the set of all observables in quantum mechanics forms a 
Lie algebra, which is one of our main reasons for studying Lie algebras here. 

Though C(P) is in general infinite-dimensional, it often has interesting finite- 
dimensional Lie subalgebras. For instance, if P — R 6 and the q, are just the usual 
cartesian coordinates for R 3 , we can consider the three components of the angular 
momentum. 


J l = A2P3 ~qiP2 

h = q-hP\- q\p-h (4.98) 

h — q\pi- qipi. 


26 A function is “infinitely differentiable” if it can be differentiated an arbitrary number of times. 
Besides the step function and its derivative, the Dirac delta ‘function’, most functions that one 
meets in classical physics and quantum mechanics are infinitely differentiable. This includes the 
exponential and trigonometric functions, as well as any other function that permits a power series 
expansion. 

27 By this we mean infinite-dimensional, and usually requiring a basis that cannot be indexed by 
the integers but rather must be labeled by elements of R or some other continuous set. You should 
recall from Sect. 3.7 that L 2 (R) was another such ‘huge’ vector space. 
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all of which are in C(R 6 ). You will check below that the Poisson bracket of these 
functions turns out to be 

3 

{Ji,jj}=Y, € w J k' ( 4 - 99) 

k= 1 

which are of course the familiar angular momentum (so(3)) commutation relations 
(as mentioned above, this is in fact where the angular momentum commutation re- 
lations come from!). This then implies that so(3) is a Lie subalgebra of C(M 6 ). 
You may be wondering, however, why the angular momentum commutation rela- 
tions are the same as the 50 (3) commutation relations. What do the functions J \ 
have to do with generators of rotations? The answer has to do with a general re- 
lationship between symmetries and conserved quantities, which goes (roughly) as 
follows: Consider a classical system (i.e. a phase space P together with a Hamil- 
tonian H eC(P )) which has a matrix Lie group G of canonical transformations 28 
acting on it. If H is invariant 29 under the action of G, then G is said to be a group 
of symmetries of the system. In this case, one can then show 30 that for every X e g 
there is a function fx € C( P) which is constant along particle trajectories in P 
(where trajectories, of course, are given by solutions to Hamilton's equations). This 
fact is known as Noether’s theorem , and it tells us that every element X of the Lie 
algebra gives a conserved quantity fx- Furthermore, the Poisson bracket between 
two such functions fx and f\ is given just by the function associated to the Lie 
bracket of the corresponding elements of g, i.e. 

[fx,fY} = fix,Y]. (4.100) 

If G — 50(3) acting on P by rotations, then it turns out ’ 1 that 

fx(qi,Pi) = {Kqi,Pi),[X\), Ye so(3) (4.101) 

where J = (J \ , Ji, J 3 ) is the usual angular momentum vector with components 
given by (4.98) and (• | •) is the usual inner product on R 3 . In particular, for 
X — L, e so(3) we have 


fu = (J, [L,]) = (J, e,) = f. (4.102) 

Thus, the conserved quantity associated to rotations about the ith axis is just the 
/th component of angular momentum! This is the connection between angular mo- 
mentum and rotations, and from (4.100) we see that the 7, must have the same 
commutation relations as the L , , which is of course what we found in (4.99). □ 


28 I.e. one-to-one and onto transformations which preserve the form of Hamilton’s equations. See 
Goldstein [6]. 

29 Let (fig : P — > P be the transformation of P corresponding to the group element g e G. Then H 
is invariant under G if H(cp g (p)) = H (p) V p e P , g e G . 

30 Under some mild assumptions. See Cannas [3] or Arnold 1 1], for example. 

31 See Arnold [1] for a discussion of Noether’s theorem and a derivation of an equivalent formula. 
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Exercise 4.30 Verify (4.99). Also show that for a function F e C(R 6 ) that depends only 
on the coordinates q , , 

{p j ,F(q l )\ = -l F - (4.103) 

dqj 

and 

, .3 F dF d F 

{7 3 ,F(^,)} = -?la I-92 t— =-T7 (4.104) 

dq 2 3q\ d <p 

where cp is the azimuthal angle. We will interpret these results in the next chapter. 

If we have a one-dimensional system with position coordinate q and conjugate 
momentum p, then !’ — R 2 and C( P) contains another well-known Lie algebra: the 
Heisenberg algebra. 

Example 4.34 The Heisenberg algebra 

Define the Heisenberg algebra H to be the span of { q , p , 1} c C(R 2 ), where 1 e // 
is just the constant function with value 1 . The only nontrivial Poisson bracket is 
between p and q, which you can check is just 


{q,p} = h (4.105) 

which apart from the factors of h (which we have dropped throughout the text) and i 
(which is just an artifact of the physicist’s definition of a Lie algebra) is the familiar 
commutation relation from quantum mechanics. H is clearly closed under the Lie 
(Poisson) bracket, and is thus a Lie subalgebra of C(R 2 ). 

Can p and q be thought of as generators of specific transformations? Well, one of 
the most basic representations of p and q as operators is on the vector space L 2 (R), 
where 


qf(x) = xf(x) (4.106) 

Pf(x) = - d f (4.107) 

ax 

(again we drop the factor of i , in disagreement with the physicist’s convention). Note 
that [q, p\ = {q, p} — 1. If we exponentiate p, we find (see Exercise 4.31 below) 

e , Pf(.x) = f(x-t ) (4.108) 

so that p = — Jy generates translations along the x-axis! It follows that {e'P | t e R} 
is the one-parameter subgroup of all translations along the x-axis. What about ql 
Well, if we work in the momentum representation of Example 3.17 so that we are 
dealing with the Fourier transform (pip) of fix), you know from Exercise 3.20 that 
q is represented by / so if we multiply by i we get iq = — ^ and thus 

e i, «cp{p) = <j ) (p-t), (4.109) 

which you can think of as a ‘translation in momentum space’ . We will explain the 
extra factor of i in the next chapter. 
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If we treat H as a complex vector space then we can consider another common 
basis for it, which is [Q, P, 1} C C(R 2 ) where 


Q = 


p = 


p + iq 

p-jq 

V2i 


You can check that 


{Q,P}=1 (4.110) 

which you may recognize from classical mechanics as the condition that Q and P 
be canonical variables (see Goldstein [6]). Q and P are well suited to the solution 
of the one-dimensional harmonic oscillator problem, and you may recognize their 
formal similarity to the raising and lowering operators a and o' employed in the 
quantum-mechanical version of the same problem. 

Q and P are not easily interpreted as generators of specific transformations, and 
our discussion of them helps explain why we defined abstract Lie algebras — so that 
we could work with spaces that behave like the Lie algebras of matrix Lie groups (in 
that they are vector spaces with a Lie bracket), but are not necessarily Lie algebras 
of matrix Lie groups themselves. 

Exercise 4.31 Show by exponentiating p = — ^ that e'P f(x) is just the power series 

expansion for fix — t). 

Exercise 4.32 Verify (4. 1 10). 


4.8 Homomorphism and Isomorphism Revisited 

In Sect. 4.3 we used the notion of a group homomorphism to make precise the rela- 
tionship between SU( 2) and SO( 3), as well as SL( 2, C) and SO(3, l) a . Now we will 
define the corresponding notion for Lie algebras to make precise the relationship 
between su(2) and so(3), as well as sl(2, C)® and so(3, 1), and we will show how 
these relationships between Lie algebras arise as a consequence of the relationships 
between the corresponding groups. 

That said, we define a Lie algebra homomorphism from a Lie algebra g to a Lie 
algebra f) to be a linear map 0 : g — ^ f) that preserves the Lie bracket, in the sense 
that 


>(X),0(y)] = 0([X,r]) vxjeg. 


(4.111) 


If 0 is a vector space isomorphism (which implies that g and f) have the same di- 
mension), then (p is said to be a Lie algebra isomorphism. In this case, there is a 
one-to-one correspondence between g and f) that preserves the bracket, so just as 
with group isomorphisms we consider g and f) to be equivalent and write g — f). 
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Sometimes the easiest way to prove that two Lie algebras g and 1) are isomorphic 

is with an astute choice of bases. Let {A',- and {T/};=i „ be bases for g 

and t), respectively. Then the commutation relations take the form 

n 

[X i ,X j ] = J2 c ij k Xk 

k= 1 
n 

[Yi, Yj] = J2 d u k n 

k= 1 

where the numbers c/j k and dij k are known as the structure constants of g and f). 
(This is a bit of a misnomer, though, since the structure constants depend on a choice 
of basis, and are not as inherent a feature of the algebra as they make themselves out 
to be.) If one can exhibit bases such that Cj k — d,j k V/, j, k, then it is easy to check 
that the map 

</>:g-> f) 

v i X i h * v‘Yi 

is a Lie algebra isomorphism. We will use this below to show that so(3) — su(2) 
and so(3, l)~st(2,C) R . 


Example 4.35 g[(V) and g[(n, C) 

Let V be an n -dimensional vector space over a set of scalars C. If we choose a basis 
for V , we can define a map from the Lie algebra gl(V) of linear operators on V to 
the matrix Lie algebra gl(n, C) by ! [7’]. You can easily check that this is a Lie 

algebra isomorphism, so g((L) ~ g[(n, C). If V is an inner product space we can 
restrict this isomorphism to isom(L) C gl(L) to get 

isom(L) ~ o(n) if C — K 
isom(L) ~ u(n) if C — C. 


Example 4.36 The ad homomorphism 

Let g be a Lie algebra. We can use the bracket to turn Xeg into a linear operator 
adx on g, by defining 


ad z (L) = [Y,y], X,T€g. 

We thus have a map 


(4.112) 


ad : g — ► gl(g) 
X adx 


between two Lie algebras. Is this map a Lie algebra homomorphism? It is if 
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ad[x.y] = [adx. ady] (4.113) 

(notice that the bracket on the left is taken in g and on the right in g[(g)). You will 
verify this in Exercise 4.33 below, where you will find that (4.113) is equivalent to 
the Jacobi identity! This is one way of interpreting the Jacobi identity: it guarantees 
that ad is a Lie algebra homomorphism for any Lie algebra g. and we will see that 
the ad homomorphism occurs frequently in physics. In fact, we already met it in 
Example 2.14! Besides its appearance in physics, the ad homomorphism is also 
fundamental in the beautiful structure theory of Lie algebras; this is treated in detail 
in Hall [8], □ 

Exercise 4.33 Verify that (4. 1 13) is equivalent to the Jacobi identity. 

Before we get to more physical examples, we need to explain how a continuous 
homomorphism from a matrix Lie group G to a matrix Lie group H leads to a 
Lie algebra homomorphism from g to f). This is accomplished by the following 
proposition: 


Proposition 4.3 Let <J> : G —*■ H be a continuous homomorphism from a matrix Lie 
group G to a matrix Lie group H . Then this induces a Lie algebra homomorphism 
<p : g — y f) given by 


</>(*) = 


(4.114) 


Proof This proof is a little long, but is a nice application of all that we have been dis- 
cussing. Let <t> : G — »■ H be such a homomorphism, and let \e rX } be a one -parameter 
subgroup in G. It’s easy to see (check!) that {<t>(e tX )} is a one-parameter subgroup 
of H, and so 


d 

dt 


<$>(e ,x ) 


= Z 
t = o 


(4.115) 


is an element of t) (by the definition of the Lie algebra of a matrix Lie group). We 
can thus define a map < p from g to f) by (p(X) = Z, and it follows that 


<I>(e ,z ) = e ^ (X) . (4.116) 

We can roughly think of <p as the ‘infinitesimal’ version of <t>, with <p taking ‘in- 
finitesimal’ transformations in g to ‘infinitesimal’ transformations in f). Is (p a 
Lie algebra homomorphism? There are several things to check. First, we must 
check that (p is linear, which we can prove by checking that (p(sX) = stp(X) and 
<p(X + Y) — (p(X) + (p( Y). To check that cp(sX) = scp(X), we can just differentiate 
using the chain rule: 


f(sX) = 



d(st) d 


dt d(st) 


t = 0 

cp( e rsX ) 


t=st = 0 
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Checking that ip(X + Y) = (j>(X) + (j)(Y) is a little more involved and involves a bit 
of calculation. The idea behind this calculation is to use the Lie product formula to 
express the addition in g in terms of the group product in G, then use the fact that 
<f> is a homomorphism to express this in terms of the product in H, and then use the 
Lie product formula in reverse to express this in terms of addition in (]. We have 


c/,(X + Y)= ^®{e ,(X+Y) ) 


t = o 


( lim (e* e‘™) m ) 

\m— >o o' ’ / 


t=0 


d 

— O 
dt 


d ,, tx 

— lim O lie m e m ) ) 

dt m -*- oo vv ' ' 

— lim 

(It m—Yoo ' v ' ' 


1=0 


d / ^0(X) ) . ffi 

— lim le "• e m ) 
dt m-to o v ' 


by Lie Product formula 
since <J> is continuous 

since <J> is a homomorphism 

r=0 

by (4.116) 


_ “tmx)+<KY)) 

dt 

= <KX) + <ky). 


1=0 

by Lie product formula 


t = o 


So we have established that <p is a linear map from g to fi. Does it preserve the 
bracket? Yes, but proving this also requires a bit of calculation. The idea behind this 
calculation is just to express everything in terms of one-parameter subgroups and 
then use the fact that <t> is a homomorphism. You can skip this calculation on a first 
reading, if desired. We have 


[<P(X), </>(T)] 


— e , ^ x) <p(Y)e- , ‘ l ’ (X) by (4.65) 
dt 

^<P(e ,x )4>(Y)<t>(e- ,x ) by (4.116) 

-®(e tx )( -e s ^)^(e- lX ) 
dt v ’ \ds J v ' 

0(e tX )0(e sY )0(e~ tX ) by (4.116) 

d d j *v [V * v \ 

0(e e e ) since <t> is a homomorphism 

dt ds y ’ 

^L^LcS,Use ,X Ye-‘ x )\ by ( 4. 63) 

dt ds y ’ y 

—(p[e ,x Ye~ ,x ) by definition of </> 
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= <p^^e tX Ye ,x ^j by Exercise 4.34 below 
= </>([*, n) by (4.65) 

and so 0 is a Lie algebra homomorphism, and the proof is complete. □ 

Exercise 4.34 Let <p be a linear map from a finite-dimensional vector space V to a finite- 
dimensional vector space W, and let y(t) : R — > V be a differentiable V - valued function 
of t (you can think of this as a path in V parametrized by t). Show from the definition of a 
derivative that 

^(y(0) = «(^y(0 J vt . (4.117) 

Now let us make all this concrete by considering some examples. 

Example 4.37 5(7(2) and 50(3) revisited 


Recall from Example 4.19 that we have a homomorphism p : 5(7(2) — > 50(3) de- 
fined by the equation 


[AXA\ = p(A)[X]t3 (4.118) 

where X e su(2), A e 5(7(2) and B — { S x . S y , S z }. The induced Lie algebra homo- 
morphism 0 is given by 


cP(Y)=^p(e tY ) 

and you will show below that this gives 


(4.119) 


4>(S i ) = L i , (=1-3. 


(4.120) 


This means that 0 is one-to-one and onto, and since the commutation relations (i.e. 
structure constants) are the same for the 5, and /., , we can then conclude from our 
earlier discussion that 0 is a Lie algebra isomorphism, and thus su(2) ~ so (3). You 
may have already known or guessed this, but the important thing to keep in mind 
is that this Lie algebra isomorphism actually stems from the group homomorphism 
between 5(7(2) and 50(3). 


Exercise 4.35 Calculate jj p(e ,s ')\i=o and verify (4.120). 


Example 4.38 5L( 2, C) and 50(3, 1) 0 revisited 


Just as in the last example, we will now examine the Lie algebra isomorphism be- 
tween sl(2, C)k and so(3, 1) that arises from the homomorphism p : SL(2,C ) — > 
50(3, 1) 0 . Recall that p was defined by 

[AXA% = p(A)[X] B (4.121) 

where A e SL( 2, C), X e /72(C) and B = {cr v , a y , a z , /}. The induced Lie algebra 
homomorphism is given again by 
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HY)= J t P(e ,Y ) 

r=0 

(4.122) 

which, as you will again show, yields 



4>{Si) = Lj 


(4.123) 

HKi) = Ki. 


(4.124) 


Thus (p is one-to-one, onto, and preserves the bracket (since the Li and K, have the 
same structure constants as the S\ and Kf), so sl(2, C)r ~ so(3. 1). □ 


Exercise 4.36 Verify (4.123) and (4.124). 

What is the moral of the story from the previous two examples? How should one 
think about these groups and their relationships? Well, the homomorphisms p allows 
us to interpret any A e SU(2) as a rotation and any A e SL(2. C) as a restricted 
Lorentz transformation. As we mentioned before, though, p is two-to-one, and in 
fact A and —A in SU(2) correspond to the same rotation in SO( 3), and likewise 
for SL( 2, C). However, A and —A are not ‘close’ to each other; for instance, if we 
consider an infinitesimal transformation A = I + eX, we have —A = —I — eX, 
which is not close to the identity (though it is close to —I). Thus, the fact that p is 
not one-to-one cannot be discerned by examining the neighborhood around a given 
matrix; one has to look at the global structure of the group for that. So one might 
say that locally, SU( 2) and SO( 3) are identical, but globally they differ. In particular, 
they are identical when one looks at elements near the identity, which is why their 
Lie algebras are isomorphic. The same comments hold for SL{ 2, C) and SO( 3, 1)„. 

One important fact to take away from this is that the correspondence between 
matrix Lie groups and Lie algebras is not one-to-one; two different matrix Lie 
groups might have the same Lie algebra. Thus, if we start with a Lie algebra, there 
is no way to associate to it a unique matrix Lie group. This fact will have important 
implications in the next chapter. 

Example 4.39 The Ad and ad homomorphisms 


You may have found it curious that su(2) was involved in the group homomorphism 
between SU( 2) and SO (3). This is no accident, and Example 4.37 is actually an 
instance of a much more general construction which we now describe. Consider a 
matrix Lie group G and its Lie algebra g. We know that for any A e G and X e g, 
AX A -1 is also in g, so we can actually define a linear operator Ad,\ on g by 


Ad A (A) = AXA~ l , A eg. 


(4.125) 


We can think of Ad A as the linear operator which takes a matrix X and applies the 
similarity transformation corresponding to A, as if A was implementing a change 
of basis. This actually allows us to define a homomorphism 


Ad : G -* GL(g) 
A i — > Ad A , 
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where you should quickly verify that Ad^Ad# = Ad,!/;. Since Ad is a homomor- 
phism between the two matrix Lie groups G and GL(g ), we can consider the induced 
Lie algebra homomorphism (p : g -> gl(g). What does </; look like? Well, if X e g 
then <f)(X) e g 1(g) is the linear operator given by 

(4.126) 

1=0 

To figure out what this is, exactly, we evaluate the right hand side on Y e g: 


0(X)= — Ad e ,x 
at 


HX){Y) = j t (M e ,x(Y)] 


t = o 


= ±( e tx Ye -,X} 

dt v 

= IX, Y] 


so (f> is nothing but the ad homomorphism of Example 4.36! Thus ad is the ‘in- 
finitesimal’ version of Ad, and the Lie bracket is the infinitesimal version of the 
similarity transformation. 

Note also that since Ad is a homomorphism. Ad e tx is a one-parameter subgroup 
in GL{ g), and its derivative at 1 — 0 is ad,y. From the discussion at the end of 
Sect. 4.4 we can then conclude that 


Ad e tx — e ' adjf (4.127) 

as an equality between operators on g. In other words, applying (4.127) to Y e g, 

1 2 f 3 

e tX Ye~ ,x = Y + t\X, Y] + -[X, [A, Y]] + -[X, [X, [X, Y]]] + ■■■. (4.128) 

It is a nice exercise to expand the left hand side of this equation as a power series 
and verify the equality; this is Problem 4.13. □ 


Exercise 4.37 Let X, H e M n (C). Use (4.127) to show that 

[X, H] = 0 4=> e‘ x He~ ,x = H VfeR. (4.129) 

If we think of H as a quantum-mechanical Hamiltonian, this shows how the invariance 
properties of the Hamiltonian (like invariance under rotations R ) can be formulated in terms 
of commutators with the corresponding generators. 

To make the connection between all this and Example 4.37, we note that usually 
Ad a will preserve a metric on g (known as the Killing Form K ; see Problem 4. 12), 
and thus 


Ad : G Isom(g). (4.130) 

In the case of G — SU(2) above, g = su(2) is three-dimensional and K is positive- 
definite, so 3 - 


32 0 f course, this identification depends on a choice of basis, which was made when we chose to 
work with B = {Si, S 2 , S 3 }. 
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Isom(su(2)) ~ 0(3), 


(4.131) 


and thus Ad : 50(2) — >■ 0(3). You can check that this map is identical to the ho- 
momorphism p described in Example 4.19, and so we actually have Ad : 50( 2) -> 
50(3)! Thus the homomorphism between 50(2) and 50(3) is nothing but the Ad- 
joint map of 50(2), where Ad(g), g e 50(2) is orthogonal with respect to the 
Killing form on su(2). We also have the corresponding Lie algebra homomorphism 
ad : su(2) -» so(3), and we know that this must be equal to </; from Example 4.37, 
so [adsjg = (p(Sj) — Lj. 

Exercise 4.38 Using [5, , Sj] = € ijkSk> compute the matrix representation of ad^. in 

the basis B and verify explicitly that [adsjg = L, . 


4.9 Problems 

4.1 In this problem we show that SO{n) can be characterized as the set of all linear 
operators which take orthonormal (ON) bases into orthonormal bases and can be 
obtained continuously from the identity. 

(a) The easy part. Show that if a linear operator R takes ON bases into ON bases 
and is continuously obtainable from the identity, then R e SO(n). It’s immediate 
that R e 0(n ) ; the trick here is showing that det R = 1 . 

(b) The converse. If R e SO(n), then it is immediate that R takes ON bases into ON 
bases. The slightly nontrivial part is showing that R is continuously obtainable 
from the identity. Prove this using induction, as follows: First, show that the 
claim is trivially true for 50(1). Then suppose that the claim is true for n — 1. 
Take R e SO(n) and show that it can be continuously connected (via orthogonal 
similarity transformations) to a matrix of the form 



(4.132) 


The claim then follows since by hypotheses R' can be continuously connected 
to the identity. (Hint: You will need the 2-D rotation which takes e\ into Re\.) 

4.2 In this problem we prove Euler’s theorem that any R e 50(3) has an eigenvec- 
tor with eigenvalue 1 . This means that all vectors i> proportional to this eigenvector 
are invariant under R, i.e. Rv = v, and so R fixes a line in space, known as the axis 
of rotation. 

(a) Show that X being an eigenvalue of R is equivalent to det (R — XI) = 0. Refer 
to Problem 3.5 if necessary. 

(b) Prove Euler’s theorem by showing that 


det(R - I) = 0. 


(4.133) 


Do this using the orthogonality condition and properties of the determinant. You 
should not have to work in components. 
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4.3 Show that the matrix (4.24) is just the component form (in the standard basis) 
of the linear operator 

R( n, 6 ) — L(n) <g> n + cos0(/ — L(n) <g> n) + sinPnx (4.134) 

where (as you should recall) L(v)(w) = ( v . w ) and the last term eats a vector v and 
spits out sin On x v. Show that the first term is just the projection onto the axis of 
rotation n, and that the second and third term just give a counterclockwise rotation 
by 6 in the plane perpendicular to n. Convince yourself that this is exactly what a 
rotation about n should do. 

4.4 Show that SO( 3, 1)„ is a subgroup of <9(3, 1). Remember that 5<9( 3, 1 ) 0 is de- 
fined by 3 conditions: |A| = 1, A 44 > 1, and (4.14). Proceed as follows: 

(a) Show that I e 5(9(3, 1) G . 

(b) Show that if A e 5(9(3. 1) G , then A -1 e 5(9(3, 1)„. Do this as follows: 

(i) Verify that |A _1 | = 1. 

(ii) Show that A -1 satisfies (4.14). Use this to deduce that A r does also. 

(iii) Write out the 44 component of (4.14) for both A and A -1 . You should get 
equations of the form 

flo = 1 + a 2 (4.135) 

bl = 1+b 2 , (4.136) 

where bo = (A - 1 ) 44 . Clearly this implies bo < — 1 or bo > 1. Now, write 
out the 44 component of the equation A A -1 — I . You should find 

fl 0 £>o = 1 — a • b. (4.137) 

If we let a = |a|, b= |b| then the last equation implies 

1 — ab < ciobo < 1 + ab. (4.138) 

Assume bo < —1 and use (4.136) to derive a contradiction to (4.138), 
hence showing that bo — (A -1 ) 44 > 1, and that A~ l e 5(9(3, 1) 0 . 

(c) Show that if A, B e 5(9(3, 1) G , then AB e 5(9(3, 1) 0 . You may have to do some 
inequality manipulating to show that (AB ) 44 > 0. 

4.5 In this problem we prove that any A e 5(9(3, 1) G can be written as a product of 
a rotation and a boost. 

(a) If A is not a pure rotation then there is some relative velocity /3 between the 
standard reference frame and the new one described by A. Use the method of 
Exercise 4.1 1 to find /3 in terms of the components of A. 

(b) Let L be the pure boost of the form (4.29) corresponding to the ft you found 
above. Show that L _1 A is a rotation, by calculating that 

(L~ l A) 44 =l 

(L- 1 A ). 4 = (L- 1 A) 4 . = 0, ( = 1,2,3. 

Conclude that A = L(L~ [ A) is the desired decomposition. 
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4.6 In this problem we show that any A e SL( 2, C) can be decomposed as A = 
LU where U e SU(2) and L is of the form (4.35). Unfortunately, it is a little too 
much work to prove this from scratch, so we will start with the polar decomposition 
theorem , which states that any A e SL( 2, C) can be decomposed as A — HU where 
U e SU( 2) and H is Hermitian and positive , which means that (v, Hv) > 0 for all 
nonzero v e C 2 (here (• | ■) is the standard Hermitian inner product on C 2 ). The 
polar decomposition theorem can be thought of as a higher-dimensional analog of 
the polar form of a complex number, z = re 1 " . You will show below that the set of 
positive, Hermitian H e SL( 2, C) is exactly the set of matrices of the form (4.35). 
This, combined with the polar decomposition theorem, yields the desired result. For 
more on the polar decomposition theorem itself, including a proof, see Hall [8], 


(a) Show that an arbitrary Hermitian 1 x H e SL( 2, C) can be written as 


where 



z 

a — b 


(4.139) 


a,beR, zeC, a 2 - b 2 - |z| 2 = 1. (4.140) 

(b) Show that any numbers a,b,z satisfying (4.140) can be written in the form 


a = ± coshw 


b = v z sinh u 
Z — ( v x — ivy ) sinhu 

for some u e R. and unit vector reR 3 . Then show that positivity requires that 
a = +cosh«. This puts H in the form (4.35). 

(c) It remains to be shown that an H of the form (4.35) is actually positive. To do 
this, we employ a theorem (see Hoffman and Kunze [10]) which states that a 
matrix B is positive if and only if it is Hermitian and all its principal minors 
are positive, where its principal minors A k(B), k <n are the k x k partial de- 
terminants defined by 


A k (B) = 


B it 
Bki 


Bik 

Bkk 


(4.141) 


Show that both the principal minors A i(H) and At)//) are positive, so that H 
is positive. 


4.7 In this problem we find an explicit formula for the map p : SU(2) — > 0(3) of 
Example 4.19 and use it to prove that p maps SU( 2) onto SO( 3) and has kernel ±7. 

(a) Take an arbitrary SU(2 ) matrix A of the form (4.25) and calculate AAA 1 for an 
arbitrary X e su(2) of the form (4.39). 

(b) Decomposing a and /3 into their real and imaginary parts, use (a) to compute 
the column vector [AX A ']. 
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(c) Use (b) to compute the 3x3 matrix p(A). Recall that p(A) is defined by the 
equation p(A)[X ] = [ AX A ’]. 

(d) Parametrize a and p as 

a = e W+*V 2 cos °_ 

2 

p = ie H *~*)/2 sin 9 _ 

2 

as in (4.26). Substitute this into your expression for p(A ) and show that this is 
the transpose (or inverse, by the orthogonality relation) of (4.23). 

(e) Parametrize a and p as in (4.92) by 

a — cos (6/2) — in z sin(0/2) 

P = (— in x — M y ) sin(0/2) 

and substitute this into your expression for p(A) to get (4.24). 

(f) Conclude that detp(A) = 1 and that p maps 57/(2) onto SO( 3). Also use your 
expression from (c) to show that the kernel of p is ±7. (It’s obvious that 
±7 e K . What takes a little calculation is showing that ±7 is all of K .) 

4.8 In this problem we find an explicit formula for the map p : SL( 2, C) — > 0(3, 1) 
of Example 4.20 and use it to prove that p maps 57.(2, C) onto 50(3, 1)„ and has 
kernel ±7. Note that since any A e 57,(2, C) can be decomposed into A — LR with 
R e 50(2) and L of the form (4.35), and any B e 50(3, l) c can be decomposed as 
B — LR' where L is of the form (4.31) and R' e 50(3) C 50(3, 1) G , our task will 
be complete if we can show that p(Z) = L. This we will do as follows: 

(a) To simplify computation, write the matrix (4.35) as 

L=( a t Z? a Z _\ a,b€R, zeC (4.142) 

and calculate LXL' for X e 772(C) of the form (4.42). 

(b) Decomposing z into its real and imaginary parts, compute the column vector 
| /.A'/.']. 

(c) Use (b) to compute the 4x4 matrix p(L). 

(d) Substitute back in the original expressions for a , b and z 

a = coshw/2 
u- 

b = — sinh u /2 
u 

1 

z = — (u x — i «y) sinh u /2 
u 

into p(L) to obtain (4.31). 

4.9 In this problem we will prove the claim from Example 4.22 that, for a given 
permutation a e 5„, the number of transpositions in any decomposition of a is 
either always odd or always even. 
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(a) Consider the polynomial 

p(x 1 , ...,x„) = Y\( Xl _x ;)- 
><j 

For example, for n = 3 and n — 4 this gives 

p(x l , x 2 , x 3 ) = (*i - x 2 )(xi - x 3 )(x 2 - x 3 ) 
p(xi, x 2 , x 3 , x 4 ) = (x 1 - x 2 )(xi - x 3 ) (x 1 - x 4 ) 

X (X 2 - X 3 )(X2 - x 4 )(x 3 - X 4 ). 

Define an action of a e S„ on p by 

(<rp)( X 1, . . . , X„) = p(x a ( 1), . . . , X a („)) = J - [(X(j (/) - XctQ')). 

‘<j 

Convince yourself that op — ip. 

(b) Let r e ,S'„ be a transposition. Prove (or at least convince yourself) that 
rp = -p. 

(c) Now assume that a has a decomposition into an even number of transpositions. 
Use p to prove that a can then never have a decomposition into an odd number 
of transpositions. Use the same logic to show that if a has a decomposition into 
an odd number of transpositions, then all of its decompositions must have an 
odd number of transpositions. 

4.10 Consider the subset H C GL(n, C) consisting of those matrices whose entries 
are real and rational. Show that H is in fact a subgroup, and construct a sequence of 
matrices in H that converge to an invertible matrix with irrational entries (there are 
many ways to do this!). This shows that H is a subgroup of GL(n, C) which is not 
a matrix Lie group. 

4.11 Let G be a Lie group and let X, Y e 0 . Suppose that X and Y are small, so that 
e x e Y is close to the identity and hence has a logarithm computable by the power 
series for In, 

k= 1 

By explicitly expanding out the relevant power series, show that up to third order in 
X and F, 

ln(e x e Y ) = X + Y + ±[X, F] + ±[X, [X, F]] - ^[F, [X, F]] H . (4.144) 

Note that one would actually have to compute higher-order terms to verify that they 
can be written as commutators, and this gets very tedious. A more sophisticated 
proof is needed to show that every term in the series is an iterated commutator and 
hence an element of g. See Varadarajan [17] for such a proof. 
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4.12 Let G be a matrix Lie group. Its Lie algebra g comes equipped with a sym- 
metric (2, 0) tensor known as its Killing Form , denoted K and defined by 

K(X, L) = -Tr(ad x ady). (4.145) 


(a) Show that K is Ad-invariant, in the sense that 

K(Ad A (X), Ad A (L)) = K(X, Y ) WX, Leg, A e G. 

(b) You know from Exercise 4.38 that [ady ] = L, . Use this to compute the compo- 
nents of K in the { Sj } basis and prove that 

[K] = 2I (4.146) 

so that K is positive-definite. This means that K is an inner product on su(2), 
and so from part (a) we conclude that Ad A e lsom(su(2)) ~ 0(3). 


4.13 Prove directly that 

Ad elX =e ,adx (4.147) 

by induction, as follows: first verify that the terms first order in t on either side 
are equal. Then, assume that the nth order terms are equal (where n is an arbitrary 
integer), and use this to prove that the n + 1th order terms are equal. Induction then 
shows that the terms of every order are equal, and so (4.147) is proven. 


4.14 Prove the formula e 1 ' x = dete x as follows: Consider the determinant func- 
tion as a homomorphism 

det : GL(n , C) -* C* 

A i — > det A. 


This induces a Lie algebra homomorphism 


<p : g[(n, C) — C. (4.148) 

We will show that <p(A) — TrA, which will imply e TrX = dete^ by (4.116). By 
definition, we have 


0(*)=^ det(e rx ) 


(4.149) 


To evaluate this, expand the exponential to first order in t, plug into the determinant 
using the formula (3.84), and expand this to first order in t using properties of the 
determinant. This should yield the desired result. 



Chapter 5 

Basic Representation Theory 


Now that we are familiar with groups and, in particular, the various transformation 
groups (i.e. matrix Lie groups) that arise in physics, we are ready to look at objects 
that ‘transform’ in specific ways under the action of these groups. The notion of an 
object ‘transforming’ in a specific way is made precise by the mathematical notion 
of a representation , which is essentially just a way of representing the elements of 
a group or Lie algebra as operators on a vector space (the objects which ‘transform’ 
are then just elements of the vector space). Representations are important in both 
classical and quantum physics: In classical physics, they clarify what we mean by 
a particular object’s ‘transformation properties’. In quantum mechanics, they both 
provide the basic mathematical framework (the Lie algebra of observables g acts on 
the Hilbert space at hand, so that the Hilbert space furnishes a representation of g) 
as well as clarify the notion of ‘vector’ and ‘tensor’ operators, which are usually 
introduced in a somewhat ad hoc way (much as we did in Sect. 3.7!). Furthermore, 
representation theory allows us to easily and generally prove many of the quantum 
mechanical ‘selection rules’ that are so handy in computation. 

As with our discussion of tensors, we will treat mainly finite-dimensional vector 
spaces here but will occasionally be interested in infinite-dimensional applications. 
Rigorous treatment of these applications can be subtle and technical, though, so as 
before we will extend our results to certain infinite-dimensional cases without ad- 
dressing the issues related to infinite-dimensionality. Again, you should be assured 
that all such applications are legitimate and can, in theory, be justified. 


5.1 Representations: Definitions and Basic Examples 

A representation of a group G is a vector space V together with a group homomor- 
phism n : G —*■ GL(V). Sometimes they are written as a pair (n, V ), though occa- 
sionally when the homomorphism n is understood we will just talk about V , which 
is known as the representation space. If V is a real vector space then we say that 
(n, V ) is ureal representation, and similarly if V is a complex vector space. We can 
think of the operators n(g) (which we will sometimes write as n ? ) as ‘represent- 
ing’ the elements of G as invertible linear operators on V. If G is a matrix Lie group, 
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and V is finite-dimensional, and the group homomorphism IT : G — > GL{V) is con- 
tinuous, then n induces a Lie algebra homomorphism ir : g — > g[(L) by (4.114). 
Any homomorphism from g to gl(V) for some V is known as a Lie algebra rep- 
resentation, so every finite-dimensional representation of a Lie group G induces 
a representation of the corresponding Lie algebra g. The converse is not true, how- 
ever; not every representation of g comes from a corresponding representation of G. 
This is intimately connected with the fact that for a given Lie algebra g, there is no 
unique matrix Lie group G that one can associate to it. We will discuss this in detail 
in the case of su(2) and so(3, 1) later. 

In many of our physical applications the vector space V will come equipped 
with an inner product (• | •) which is preserved by the operators n „ (in other words, 
IT (g) e Isom(V) Vy e G). In this case, we say that II is a unitary representation, 
since each 11(g) will be a unitary operator (cf. Example 4.5). If it is the induced Lie 
algebra representation, then XX <= g, we have 

(u|u>) = {n{e ,x )v\n{e ,x )w) Vv, w e V, t e R. (5.1) 

Employing our standard trick of differentiating with respect to t and evaluating at 
t — 0 gives 

0 = (jt(X)v\w) + (y\n(X)w) 

= (n(X)v\ w) + (tc(X)^v\w) 

— ([^(Z) + ^r(Z) t ]u|ui) 
n(X)+Tt{Xyt = 0 

so 7 r(X) is anti-Hermitian for all X\ Thus, if 


then 


II : G -* Isom(y), 


7t : g — > isom(V) (5.2) 

as the notation suggests. Any Lie algebra homomorphism n : g — > isotul V ) is called 
a unitary representation of g, regardless of whether or not it comes from a unitary 
representation of G. 

One of the reasons unitary representations are so common in physics is that in 
quantum mechanics, states are represented by unit vectors and so any linear oper- 
ator T that takes a state into another state must preserve norms. This implies (as 
can be shown via an argument almost identical to the one used in Exercise 4.19) 
that T must be unitary, and so virtually all representations of interest in quantum 
mechanics are unitary. 

Example 5.1 The trivial representation 

As the name suggests, this example will be somewhat trivial, though we will end up 
referring to it later. For any group G (matrix or discrete) and vector space V, define 
the trivial representation of G on V by 
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n(g) = / VgeG. 


(5.3) 


You will verify below that this is a representation. Suppose in addition that G is 
a matrix Lie group. What is the Lie algebra representation induced by n? For all 
Xegwe have 



d 

— 1 
dt 


— 0 


so the trivial representation of a Lie algebra is given by jt(X) = 0 WX e g. 

Exercise 5.1 Let our representation space be V = C. Show that GL( C) ~ GL(1, C) ~ C* 
where C* is the group of nonzero complex numbers. Then verify that n : G —> C* as 
defined above is a group homomorphism, hence a representation. Also verify that tt : g — > 
gl(C) — C given by tt(X) = 0 VX is a Lie algebra representation. 

Example 5.2 The fundamental representation 

Let G be a matrix Lie group. By definition, G is a subset of GL(n, C ) = GL(C n ) 
for some n, so we can consider the obvious representation just given by interpreting 
the elements of G as operators (acting by matrix multiplication) on V — C" . This is 
known as the fundamental (ox standard) representation of G. If G — 0(3) or SO( 3) 
then V = K 3 and the fundamental representation is known as the vector represen- 
tation. If G = SU( 2) then V = C 2 and the fundamental representation is known 
as the spinor representation. If G = SO(3, 1) G or 0(3, 1) then V — R 4 , and the 
fundamental representation is also known as the vector (or sometimes four-vector) 
representation. If G — SL( 2, C) then V — C 2 , and the fundamental representation is 
also known as the spinor representation. Vectors in this last representation are some- 
times referred to more specifically as left-handed spinors, and are used to describe 
massless relativistic spin 1/2 particles like the neutrino. 1 

Each of these group representations induces a representation of the correspond- 
ing Lie algebra which then goes by the same name, and which is also given just by 
interpreting the elements of g C gl (n, C ) as linear operators. Since Lie algebras are 
vector spaces and a representation ji is a linear map, we can describe any Lie alge- 
bra representation completely just by giving the image of the basis vectors under n 
(this is one of the nice features of Lie algebra representations; they are much easier 
to concretely visualize). Thus, the vector representation of so(3) is given by 


'There is, of course, such a thing as a right-handed spinor as well, which we will meet in the next 
section and which is also used to describe massless spin 1 /2 particles. The right and left-handed 
spinors are known collectively as Weyl spinors, in contrast to the Dirac spinors, which are used to 
describe massive spin 1 /2 particles. We shall discuss Dirac spinors towards the end of this chapter. 
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n(L x ) = 

Jt(Ly) = 

n(L z ) 

where, of course, n is just the identity, 
is given by 

Jr (S x ) = 

7T(Sy) = 

n(S z ) = 



Likewise, the spinor representation of su(2) 


0 


-i 0 
0 i 


and similarly for the vector representation of so(3, 1) and the spinor representation 
of sl(2, C)r. 


Exercise 5.2 Show that the fundamental representations of SO(3). 0(3), and 5(7(2) are 
unitary. (The fundamental representations of S0( 3, 1) D , 0(3, 1), and SL(2, C) are not uni- 
tary. which can be guessed from the fact that the matrices in these groups are not unitary 
matrices. This stems from the fact that these groups preserve the Minkowski metric, which 
is not an inner product.) 


Example 5.3 The adjoint representation 


A less trivial class of examples is given by the Ad homomorphism of Example 4.39. 
Recall that Ad is a map from G to GL(g), where the operator Ad A (for ,4 e G ) is 
defined by 

Ad A (X) = AAA -1 , A eg. 

In the context of representation theory, the Ad homomorphism is known as the ad- 
joint representation (Ad, g). Note that the vector space of the adjoint representation 
is just the Lie algebra of G! The adjoint representation is thus quite a natural con- 
struction, and is ubiquitous in representation theory (and elsewhere!) for that reason. 
To get a handle on what the adjoint representation looks like for some of the groups 
we have been working with, we consider the corresponding Lie algebra representa- 
tion (ad, g), which you will recall acts as 

ad z (T) = [X, Y ], X,Y eg. 

For so(3) with basis B = {L/},=i_ 3, you have already calculated in Exercise 4.38 
(using the isomorphic su(2) with basis { 5 1 / }/ = i 3) that 



5.1 Representations: Definitions and Basic Examples 


149 


[ a dL;]e = L, 

so for so (3) the adjoint representation and fundamental representation are identical! 
(Note that we did not have to choose a basis when describing the fundamental rep- 
resentation because the use of the standard basis there is implicit.) Does this mean 
that the adjoint representations of the corresponding groups 50(3) and 0(3) are 
also identical to the vector representation? Not quite. The adjoint representation of 
50( 3) is identical to the vector representation (as we will show), but that does not 
carry over to 0(3); for 0(3), the inversion transformation —I acts as minus the 
identity in the vector representation, but in the adjoint representation acts as 

Ad _/(X) = (-/)*(-/) = X (5.4) 

so Ad_/ is the identity! Thus the vector and adjoint representations of 0(3), though 
similar, are not identical, and so the adjoint representation is known as the pseu- 
dovector representation.- This will be discussed further in the next section. 

What about the adjoint representations of SU(2) and su(2)? Well, we already 
met these representations in Examples 4.19 and 4.37, and since su(2) ~ so(3) and 
the adjoint representation of so(3) is the vector representation, the adjoint represen- 
tations of both SU( 2) and su(2) are also known as their vector representations. 

As for the adjoint representations of 50(3, 1) G and 0(3, 1), it is again useful to 
consider first the adjoint representation of their common Lie algebra, so (3, 1). The 
vector space here is so(3, 1) itself, which is six-dimensional and spanned by the 
basis B — { L ; , A',},- ;=i_ 3 . You will compute in Exercise 5.3 below that the matrix 
forms of ad,- and ad a- are (in 3 x 3 block matrix form) 

L'i 1 

[ad D=(o “,) < 5 - 5) 

[“**] = (_°i, “o'")' (56) 

From this we see that the Lj and Kj both transform like vectors under rotations 
( ad ), but are mixed under boosts (ad a-, ). This is reminiscent of the behavior of the 
electric and magnetic field vectors, and it turns out (as we will see in Sect. 5.4) that 
the action of so(3, 1) acting on itself via the adjoint representation is identical to the 
action of Lorentz transformation generators on the antisymmetric field tensor F llv 
from Example 3.16. The adjoint representation for so(3, 1) is thus also known as the 
antisymmetric second rank tensor representation, as are the adjoint representations 
of the corresponding groups 50(3, 1) 0 and 0(3, 1). We omit a discussion of the 
adjoint representation of SL(2, C) for technical reasons. 2 3 □ 


2 You shouldn't be surprised at the nomenclature here, since we saw in Example 3.29 that pseu- 
dovectors are essentially 3x3 antisymmetric matrices, which are exactly so (3)! 

^Namely, that the vector space in question, sl(2, C)r, is usually regarded as a three-dimensional 
complex vector space in the literature, not as a six-dimensional real vector space (which is the 
viewpoint of interest for us), so to avoid confusion we omit this topic. This will not affect any 
discussions of physical applications. 
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Exercise 5.3 Verify (5.5) and (5.6). 

At this point your head may be swimming with nomenclature; the following table 
should help: 


Group 

Fundamental rep 

Adjoint rep 

50(3) 

vector 

vector 

0(3) 

vector 

pseudovector 

51/ (2) 

spinor 

vector 

50(3, l) c 

vector 

antisymmetric rank 2 tensor 

0(3,1) 

vector 

antisymmetric rank 2 tensor 

SL( 2, C) 

Weyl spinor 

- 


5.2 Further Examples 

The fundamental and adjoint representations of a matrix Lie group are the most ba- 
sic examples of representations and are the ones out of which most others can be 
built, as we will see in the next section. There are, however, a few other represen- 
tations of matrix Lie groups that you are probably already familiar with, as well 
as a few representations of abstract Lie algebras and discrete groups that are worth 
discussing. We will discuss these in this section, and return in the next section to 
developing the general theory. 

Example 5.4 Representations of Z 2 

The notion of representations is useful not just for matrix Lie groups, but for more 
general groups as well. Consider the finite group 7*2 = {1,-1}. For any vector 
space V , we can define the alternating representation (TI a i t , V) of Zt by 


rialt(l) = I 

n alt (— i) = -i. 

This, along with the trivial representation, allows us to succinctly distinguish 
between the vector and pseudovector representations of 0(3) 4 ; when restricted 
to Z 2 — {/,—/} C 0(3), the fundamental (vector) representation of 0(3) be- 
comes n alt , whereas the adjoint (pseudovector) representation becomes n t rivi a i. 

Another place where these representations of Z 2 crop up is the theory of identical 
particles. Recall from Example 4.22 that there is a homomorphism sgn : S n — »■ Z 2 
which tells us whether a given permutation is even or odd. If we then compose this 
map with either of the two representations introduced above, we get two representa- 
tions of S„ : n a it o sgn, which is known as the sgn (read: ‘sign’) representation of S n , 


4 And, as we will see, between the vector and pseudovector representations of 0(3, 1). 
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and n trivial ° sgn, which is just the trivial representation of 5 „ . If we consider an n- 
particle system with Hilbert space then from Example 4.22 we know that 

S n {H) C Tp(H) furnishes the trivial representation of 5„, whereas A"H C Tff(H) 
furnishes the sgn representation of S„ . This allows us to restate the symmetrization 
postulate once again, in its arguably most succinct form: For a system composed of 
n identical particles, any state of the system lives either in the trivial representation 
of S„ {in which case the particles are known as bosons) or in the sgn representation 
of S n {in which case the particles are known as fermions). 

Example 5.5 The four-vector representation of SL{ 2, C) 

Recall from Example 4.20 that SL{ 2, C) acts on // 2 (C) by sending X —> AX A' , 
where X e // 2 (C) and A e SL{ 2, C). It is easy to see that this actually defines a 
representation (n, // 2 (C)) given by 

n {A){X) = AXA^. (5.7) 

We already saw that if we take B — {er v , er v ,ov, /} as a basis for // 2 (C), then 
[11(A)] e 5(9(3. 1), so in this basis the action of n(A) looks like the action of re- 
stricted Lorentz transformations on four- vectors. Hence (n, // 2 (C)) is also known 
as the four-vector representation of SL{2 , C). 

Example 5.6 The right-handed spinor representation of 57.(2, C) 

As one might expect, the right-handed spinor representation (TT , C 2 ) of 57.(2, C) is 
closely related to the fundamental (left-handed spinor) representation. It is defined 
simply by taking the adjoint inverse of the fundamental representation, that is, 

fl(A)u = A^u, A e SL{2, C), veC 2 . (5.8) 

You can easily check that this defines a bona fide representation of SL{2, C). The 
usual four component Dirac spinor can be thought of as a kind of ‘sum’ of a left- 
handed spinor and a right-handed one, as we will discuss later on. □ 

The next few examples are instances of a general class of representations that are 
worth describing briefly. Say we are given a finite-dimensional vector space V, a 
representation n of G on V, and a possibly infinite-dimensional vector space C ( V ) 
of functions on V (this could be, for instance, the set Pi{V) of all polynomial func- 
tions of a fixed degree /, or the set of all infinitely differentiable complex-valued 
functions C{V), or the set of all square-integrable functions L 2 (V )). Then the rep- 
resentation n on V induces a representation n on C ( V ) as follows: if / e C (V ), 
then the function Yl g f e C{V) is given by 

(n*/)(i0 = /(n-'u), g e G, f e C(V), V e V. (5.9) 

There are a couple things to check. First, it must be verified that Id g f is actually 
an element of C{V); this, of course, depends on the exact nature of C ( V ) and of n 
and must be checked independently for each example. Assuming this is true, we also 
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need to verify that 11 (g) is a linear operator and that it satisfies tl(gh) = n(g)n(/t). 
This computation is the same for all such examples, and you will perform it in 
Exercise 5.4 below. 

Exercise 5.4 Confirm that if (I~I, V) is a representation of G, then fig is a linear operator 
on C(V), and that n(gi)n(g 2 ) = ^i(glgl)- What happens if you try to define n using g 
instead of g _1 ? 

Example 5.7 The spin s representation of su(2) and polynomials on C 2 

We already know that the Hilbert space corresponding to a spin s particle fixed in 
space is C 2s+1 . Since the spin angular momentum S is an observable for this sys- 
tem and the S, have the su(2) structure constants, C 2i+1 must be a representation 
of su(2). How is this representation defined? Usually one answers this question by 
considering the eigenvalues of S z and showing that for any finite-dimensional rep- 
resentation V, the eigenvalues of S- lie between — s and s for some half-integral .v . 
V is then defined to be the span of the eigenvectors of S- (from which we conclude 
that dim V = 2s + 1), and the action of S x and S v is determined by the su(2) com- 
mutation relations. This construction is important and will be presented in Sect. 5.8, 
but it is also rather abstract; for the time being we present an alternate construction 
which is more concrete. 

Consider the set of all degree / polynomials on C 2 , i.e. the set of all degree / 
polynomials in the two complex variables z i and Z2- This is a complex vector space, 
denoted P/( C 2 ), and has basis 


Bi = {z[ k z\\ 0<k<l} = {z[,z[ 1 zi, ■ ■ ■ , z\z l 2 \4} 


and hence dimension / + 1 . The fundamental representation of SU( 2) on C 2 then in- 
duces an SU( 2) representation (11/ , Pj (C 2 )) as described above: given a polynomial 
function p e P/(C 2 ) and A e SU( 2), n/(A)(p) is the degree / polynomial given by 

(n/(A)(p))(u) = p(A -1 i>), v e C 2 . 

To make this representation concrete, consider the degree one polynomial 
p(zi, Z2) = Zi- This polynomial function just picks off the first coordinate of any 
v e C 2 . Let 



so that 



Then 


(Hi (A)p)(u) = p(A 1 u) 


but 
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so then 


A-V 



( ot_Z] - fiz2\ 
\Pzi+UZ 2 ) 


(5.10) 


ni(A)si = az\ — Pzi- 

Likewise, (5.10) tells us that 


rii(A)42 = fz\ + OLZ2- 


From this, the action of 11/ (A) on higher order polynomials can be determined since 

n ,(A)(z[- k z k 2 ) = (n l (A)ziy- k (n l (A)z 2 ) k . 

We can then consider the induced Lie algebra representation ( 7 r/, /) (C 2 )), in 
which (as you will show) z , f k z\ is an eigenvector of S z with eigenvalue i(j — k), 
so that 


( 1/2 


[S-Jb, = 1 


1 / 2 - 1 


-1/2/ 


(5.11) 


If we let s = l /2 then we recognize this as the usual form (up to that pesky factor 
of i) of S z acting on the Hilbert space of a spin s particle. □ 


Exercise 5.5 Use the definition of induced Lie algebra representations and the explicit form 
of e' s ‘ in the fundamental representation (which can be deduced from (4.92)) to compute 
the action of the operators tti(S;) on the functions z 1 and Z 2 in FAC 2 ). Show that we can 
write these operators in differential form as 

n i ( 9 3 \ 

’' ,s "-2( l, zr, +z, zrJ 

1/3 3 \ 

ni{S2)= -l\ Z2 ^[- Zl ^) (5 ' 12) 

i ( 3 3 \ 

Wl(S3) = 2(^- Z2 a^J- 

Prove that these expressions also hold for the operators 7Zi(S;) on P/(C 2 ). Verify the su(2) 
commutation relations directly from these expressions. Finally, use (5.12) to show that 

{m(s z )){z[~ k z k 2 ) = w / 2 - k) z [- k z k 2 . 

Example 5.8 L 2 (R 3 ) as a representation of SO (3) 

Recall from Example 3.18 that L 2 (R 3 ) is the set of all complex- valued square- 
integrable functions on R 3 , and is physically interpreted as the Hilbert space of a 
spinless particle moving in three dimensions. As in the previous example, the funda- 
mental representation of 50(3) on R 3 induces an 50(3) representation (n, L 2 (R 3 )) 
by 


(n*(/))(x) = f e L 2 (R 3 ), R e 50(3), x e R 3 . (5.13) 
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One can think of Yl(R)f as just a “rotated” version of the function /. This repre- 
sentation is unitary, as we will now digress for a moment to show. 

Let (• | ■) denote the Hilbert space inner product on L 2 (R 3 ). To show that n is 
unitary, we need to show that 


(n(*)/|II(J?)g) = (/|s) V/, g e L 2 (R 3 ). (5.14) 

First off, we have 

(n(R)f\n(R)g) = I d 3 x (n R f)(x)(n Rg )(x) 

= J d 3 x f(R~ l x)g(R~ l x). 

If we now change variables to x' = R~ l x and remember to include the Jacobian, we 
get 


(U(R)f\Tl(R)g) 


f 

/ 


d 3 x' 


d(x, y, z) 
d(x', y', z') 


d 3 x' f(x!)g(x!) 


/(x')g(x') 


= if\g) 

where you will verify below that the Jacobian determinant | g^ ,3 ’/'; ) , ) | is equal to one. 
This should be no surprise; the Jacobian tells us how volumes change under a change 
of variables, but since in this case the change of variables is given just by a rotation 
(which we know preserves volumes), we should expect the Jacobian to be one. 

Though the definition of n might look strange, it is actually the action of ro- 
tations on position kets that we are familiar with; if we act on the basis ‘vector’ 
|xq) = <5(x — xq), we find that (for arbitrary ijr e L 2 (R 3 )) 


(Vf|n s |x 0 ) = J d 3 x\//(x)S(R 1 x x 0 ) 

= J d 3 x' \j/{Rx!)&(x! — xo) where we let x' = R~ l x 
= lA(tfxo) 

hence we must have 


n s |x 0 > = | Rx q ) 

which is the familiar action of rotations on position kets. 

What does the corresponding representation of so (3) look like? 5 As mentioned 
above, we can get a handle on that just by computing n(Li), since jt is a linear map 


5 We should mention here that the infinite-dimensionality of L 2 (R 3 ) makes a proper treatment of 
the induced Lie algebra representation quite subtle; for instance, we calculate in this example that 
the elements of so(3) are to be represented by differential operators, yet not all functions in L 2 (R 3 ) 
are differentiable! (Just think of a step function which is equal to 1 inside the unit sphere and 0 
outside the unit sphere; this function is not differentiable at r = 1.) In this example and elsewhere, 
we ignore such subtleties. 
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and any X e so(3) can just be expressed as a linear combination of the L , . Hence 
we compute 


(ttl, /)(X) = — (n e tLi /)(x) 


by the def. of tx 


t= o 


= -f(e~ rLi x j 
dt JK ’ 


by the def. of n 


r=0 


A 9 f d , \j 

- > — (x) — (e ,L 'xy 
^ dxJ dt y ’ 


7 = 1 
3 


by the multivariable chain rule 


f=0 




7 = 1 


dx J 


~ X! a../ ( x )( L i)jkX k 


j,k=l 


dxJ 


0 jr 

— ^ ei jk x k j^j(x) since (Li) j k = -€ij k 
j,k=l 


so that, relabeling dummy indices, 


7T 


/r 3 

(Li) - “ 2 ^ e Lk x 

j,7-l 


More concretely, we have 


n(L x ) 


7T(Ly) 


3 3 

Z 3y _ ' V 3z 
3 3 


3z 3x 


?r(Xz) = y— 
d.T 



which, up to our usual factor of i . is just (2.11)! 


Exercise 5.6 Verify that if x' = R l x for some rotation R e 50(3), then 

S(x,y,z) _ 

3(x', /, z') 


(5.15) 


□ 


(5.16) 


Example 5.9 ?f/(R 3 ), fi; and L 2 (S 2 ) as representations of S0( 3) 

Recall from Chap. 2 that 'H/OSf ) is the vector space of all harmonic complex-valued 
degree / polynomials on R 3 . Since Hi(M. 3 ) is a space of functions on R 3 , we get an 
50(3) representation of the same form as in the last example, namely 
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(n ,(RXf))(±) = /(R- 1 *), feH,( R 3 ), ReSO( 3), xeR 3 . 

You will check below that if / is a harmonic polynomial of degree /, then Tli(R)f 
is too, so that 11/ really is a representation on Hi (R 3 ). The induced so(3) represen- 
tation also has the same form as in the previous example. 

If we now restrict all the functions in Hi (R 3 ) to the unit sphere, we get a repre- 
sentation of SO( 3) on Hi, the space of spherical harmonics of degree /. Concretely, 
we can describe this representation by writing Y(d,(p) as Y(n), where n is a unit 
vector giving the point on the sphere which corresponds to ( 6,(p ). Then the .5(9(3) 
representation (Tl,Hi) is given simply by 


(fi/(/?)y)(n) = r(/r 1 n). (5.17) 

The interesting thing about this representation is that it also turns out to be unitary! 
The inner product in this case is just given by integration over the sphere with the 
usual area form, i.e. 

f*7 r /»2 7T 

(Fi(6>, <p)\Y 2 (.6, (/)))= / / YHO, <j>) Y 2 {9, cj))sm6d(t>d0. (5.18) 

Jo Jo 

Proving that (5.17) is unitary with respect to this inner product is straightforward 
but tedious, so we omit the calculation. 6 

One nice thing about this inner product, though, is that we can use it to define a 
notion of square-integrability just as we did for R and R 3 : we say that a C-valued 
function Y(d,(p) on the sphere is square-integrable if 


(Y (9, (p)\Y(e, (/)))= f f \Y (6 , cf>)\ 2 sin 6 d(f> d 6 < oo. (5.19) 

Jo Jo 


lust as with square-integrable functions on R. the set of all such functions forms a 
Hilbert space, usually denoted as L 2 (S 2 ), where S 2 denotes the (two-dimensional) 
unit sphere in R 3 . It is easy to see 7 that each Hi C L 2 (S 2 ), and in fact it turns 
out that all the Hi taken together are actually equal to L 2 (S 2 )\ We will dis- 
cuss this further in Sect. 5.6, but for now we note that this implies that the set 
[Y l m | 0 < / < oo, — l < m < 1} of all the spherical harmonics form an (orthonormal) 
basis for L 2 (S 2 ). This can be thought of as a consequence of the spectral theorems 
of functional analysis, crucial to quantum mechanics, which tell us that, under suit- 
able hypotheses, the eigenfunctions of a self-adjoint linear operator (in this case the 
spherical Laplacian A S 2 ) form an orthonormal basis for the Hilbert space on which 
it acts. □ 


6 Just as in the previous example, however, one can define transformed coordinates 6' and </>' and 
then unitarity hinges on the Jacobian determinant | | being equal to one, which it is because 

rotations preserve area on the sphere. 

7 Each Y £ Hi is the restriction of a polynomial to S 2 and is hence continuous, hence |T| 2 must 
have a finite maximum M g R. This implies 


(Y\Y) < AnM < oo. 


(5.20) 
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Exercise 5.7 Let / e 77/ (R 3 ). Convince yourself that is also a degree l polynomial, 
and then use the chain rule to show that it is harmonic, hence an element of 77/(R 3 ). The 
orthogonality of R should be crucial in your calculation ! 

Exercise 5.8 If you have never done so, find the induced so (3) representation if/ on 77/ by 
expressing (5.15) in spherical coordinates. You should get 


3 3 

jr[(L x ) = sii\cp 1- cot 9 cos <p — 

3 9 dcp 

3 o 

7T/(L V ) = — cos cp h cot (9 sm cp - — 


3(9 


dcp 


ni(L z ) = - 


3 

dcp' 


Check directly that these satisfy the so(3) commutation relations, as they should. 


The representations we have discussed so far have primarily been representations 
of matrix Lie groups and their associated Lie algebras. As we mentioned earlier, 
though, there are abstract Lie algebras which have physically relevant representa- 
tions too. We will meet a couple of those now. 


Example 5.10 The Heisenberg algebra acting on L 2 (W) 


We have mentioned this representation a few times already in this text, but we dis- 
cuss it here to formalize it and place it in its proper context. The Heisenberg algebra 
H = Span{<7, p, 1} C C(R 2 ) has a unitary Lie algebra representation jr on L 2 (R) 
given by 

(itq (/))(*) = ixf(x) 

(M/))(*) = -^<*) (5.21) 

(tfi (/))(*) = if(x). 

To verify that n is indeed a Lie algebra representation, one needs only to verify 
that the one non-trivial bracket is preserved, i.e. that [n(q), rc( p)\ = 7t([q , p\). This 
should be a familiar fact by now, and is readily verified if it is not. Showing that 
it is unitary requires a little bit of calculation, which you will perform below. The 
factors of i appearing above, especially the one in (5.21), may look funny; you 
should keep in mind, though, that the absence of these factors of i in the physics 
literature is again an artifact of the physicist’s convention in defining Lie algebras, 
and that these factors of i are crucial for ensuring that the above operators are anti- 
Hermitian, as will be seen below. Note that the usual physics notation for these 
operators is q = 7t q — n(q) and p = n p = : r(p). 

Exercise 5.9 Verify that n(q), n(p), and 7r(l) are all anti-Hermitian operators with respect 
to the usual inner product on L 2 (R). Exponentiate these operators to find, for t,a,9 el, 

(«*<»>/)(*) = «''*/(*) 

(e a ^f)(x) = f(x-a) 

(e e *Wf)(x) = e w f(x) 
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and conclude, as we saw (in part) in Example 4.34, that n(q) generates translations in 
momentum space, n(p) generates translations in x, and jt( 1) generates multiplication by a 
phase factor. 

Example 5.11 The adjoint representation of C{P) 


Recall from Example 4.36 that for any Lie algebra g, regardless of whether or not it 
is the Lie algebra of a matrix Lie group, there is a Lie algebra homomorphism 


ad : g 


0 [(g) 


X adx 

and hence a representation of g on itself. Suppose that g = C(R 6 ), the Lie algebra of 
observables on the phase space R (l which corresponds to a single particle living in 
three-dimensional space. What does the adjoint representation of C(R 6 ) look like? 
Since C(R 6 ) is infinite-dimensional, computing the matrix representations of basis 
elements is not really feasible. Instead, we pick a few important elements of C(R 6 ) 
and determine how they act on C(K 6 ) as linear differential operators. First, consider 
ad 9i e gl(C(M 6 )). We have, for arbitrary / e C(R 6 ), 


ad qi f = {qi,f} = 


9/ 

dpi 


(5.22) 


and so just as — ^ generates translation in the x -direction, ad 9i = generates 
translation in the p, direction in phase space. Similarly, you can calculate that 


adp; = — t— (5.23) 

dqi 

so that ad /)( generates translation in the q, direction. If / e C ( R/’ ) depends only on 
the q i and not the p, , then one can show that 


ad z. 3 / = ~ 


9/ 

3 (p 


(5.24) 


where (/> = tan -1 (q 2 /q\) is the azimuthal angle, so that L 3 generates rotations 
around the z-axis. Finally, for arbitrary f e C(R 6 ), one can show using Hamilton’s 
equations 8 that 

ad H = ~ (5.25) 

dt 

so that the Hamiltonian generates time translations. These facts are all part of the 
Poisson Bracket formulation of classical mechanics, and it is from this formal- 
ism that quantum mechanics gets the notion that the various ‘symmetry generators’ 
(which are of course just elements of the Lie algebra of the symmetry group G in 
question) that act on a Hilbert space should correspond to physical observables. 


Exercise 5.10 Verify (5.23) — (5.25). If you did Exercise 4.30, you only need to verify (5.25), 
for which you will need Hamilton’s equations, = pjj , | jr = 


See Goldstein [6]. 
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The next step in our study of representations is to learn how to take tensor products 
of representations. This is important for several reasons: First, as we will soon see, 
almost all representations of interest can be viewed as tensor products of other, more 
basic representations. Second, tensor products are ubiquitous in quantum mechanics 
(since they represent the addition of degrees of freedom), so we better know how 
they interact with representations. Finally, tensors can be understood as elements of 
tensor product spaces (cf. Sect. 3.5) and so understanding the tensor product of rep- 
resentations will allow us to understand more fully what is meant by the statement 
that a particular object ‘transforms like a tensor’. 

Suppose, then, that we have two representations (TT i , V \ ) and (IT 2 , V 2 ) of a 
group G. Then we can define their tensor product representation ( IT 1 0 n 2 , V) 0 
V 2 ) by 


(n, ® n 2 )(g) = n x (g) ® n 2 (g) e av x ® v 2 ). 


(5.26) 


where you should recall from (3.69) how Ili(g) ® n 2 (g) acts on Vj ® V 2 . It is 
straightforward to check that this really does define a representation of G. If G is a 
matrix Lie group, we can then calculate the corresponding Lie algebra representa- 
tion: 


(n 1 ® n 2 )(X) 


r = 0 


= lim 

0 

= lim 

h-> 0 

= lim 
h-> 0 


n { {e hx )®n 2 {e nA )- I ®I 


hX\ 


h 


Tli(e hx ) 0 n 2 (e hx ) - I 0 Y\ 2 {e nx ) + / 0 U 2 {e hx ) -70/ 


n hx 


,hX\ 


„hX s 


(Th(e hx )-I) 


) 


lim 

/ 1 — »-o 


iU 2 ie hx )-I) 


n\ ( X ) ®/ + 7® n 2 (X) 


(5.27) 


where in the second-to-last line we used the bilinearity of the tensor product. If we 
think of Tti as a sort of ‘derivative’ of IT;, then one can think of (5.27) as a kind of 
product rule. In fact, the above calculation is totally analogous to the proof of the 
product rule from single-variable calculus! It is a nice exercise to directly verify that 
7T 1 ® 7r 2 is a Lie algebra representation; this is Exercise 5.11 below. 


Exercise 5.11 Verify that (5.27) defines a Lie algebra representation. Mainly, this consists 
of verifying that 


[in 1 ®n 2 )(X), (jti ® 7T 2 )(L)] = (jti ® n 2 )([X, 7]). 

You may find the form of (5.27) familiar from the discussion below (3.69), as 
well as from other quantum mechanics texts; we are now in a position to explain 
this connection, as well as clarify what is meant by the terms “additive” and “multi- 
plicative” quantum numbers. 
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Example 5.12 Quantum mechanics, tensor product representations, and additive 
and multiplicative quantum numbers 

We have already discussed how in quantum mechanics one adds degrees of free- 
dom by taking tensor products of Hilbert spaces. We have also discussed (in Exam- 
ple 4.33) how a matrix Lie group of symmetries of a physical system (i.e. a matrix 
Lie group G that acts on the phase space P and preserves the Hamiltonian H) gives 
rise to a Lie algebra of observables isomorphic to its own Lie algebra g, and how 
the quantum mechanical Hilbert space associated to that system should be a rep- 
resentation of g. Thus, if we have a composite physical system represented by a 
Hilbert space H = H\ ® Hi, and if the Ti, carry representations jr, of some matrix 
Lie group of symmetries G, then it is natural to take as an additional axiom that G 
is represented on H by the tensor product representation (5.26), which induces the 
representation (5.27) of g on H. 

For example, let G be the group of rotations SO( 3), and let TL\ = L 2 (R 3 ) cor- 
respond to the spatial degrees of freedom of a particle of spin s and Hi — C 2,v+I , 
2s eZ correspond to the spin degree of freedom. Then so (3) is represented on the 
total space H = L 2 (R 3 ) <g> C 2i+1 by 


(jr ® jt s )(Lj) = jz(Lj) ® / + / 0 Jt s (Li) 


(5.28) 


where j r is the representation of Example 5.8 and tt s is the spin s representation 
from Example 5.7. If we identify (re ® 7t s )(L,) with J,-, the / th component of the 
total angular momentum operator, and i r(L ; ) with L the /th component of the 
orbital angular momentum operator, and jr s (L/) with S, , the /th component of the 
spin angular momentum operator, then (5.28) is just the component form of 


J = L®/ + /®S, 


(5.29) 


the familiar equation expressing the total angular momentum as the sum of the spin 
and orbital angular momentum. We thus see that the form of this equation, which 
we were not in a position to understand (mathematically) when we first discussed it 
in Example 3.20, is dictated by representation theory, and in particular by the form 
(5.27) of the induced representation of a Lie algebra on a tensor product space. 
The same is true for other symmetry generators, like the translation generator p in 
the Heisenberg algebra. If we have two particles in one dimension with correspond- 
ing Hilbert spaces Hj, i = 1,2, along with representations jr, of the Heisenberg 
algebra, then the representation of p on the total space H = H\_(& Hi is just 

(jt j 0 7 r 2 )ip) = Ttlip) 0 1 + I 0 TXiip) = pi 0 / + / 0 pi (5.30) 

where p, = tc, (p). This expresses the fact that the total momentum is just the sum 
of the momenta of the individual particles ! 

More generally, (5.27) can be seen as the mathematical expression of the fact that 
physical observables corresponding to generators in the Lie algebra are addi- 
tive. More precisely, we have the following: let i>,- e Hi, i = 1, 2 be eigenvectors of 
operators 7r,(A) with eigenvalues a/, where A is an element of the Lie algebra of a 
symmetry group G. Then v\ ® vi is an eigenvector of (7Ti ® 7T2)(A) with eigenvalue 
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a i + «2 ■ 1 11 other words, the eigenvalue a of A i s an additive quantum number. Most 
familiar quantum numbers, such as energy, momentum and angular momentum, are 
additive quantum numbers, but there are exceptions. One such exception is the parity 
operator P. If we have two three-dimensional physical systems with corresponding 
Hilbert spaces Hi, then the H, should furnish representations n, of 0(3). Now, 
P — — I e 0(3) and if v, e Hi are eigenvectors of n, (P) with eigenvalues A;, then 
1>1 0 u 2 e 7-(i 0 H 2 has eigenvalue 

((rii 0 n 2 )(i 3 ))(ui 0 v 2 ) = ni (P)vi 0 n 2 (P)v 2 
= (A.iA. 2 )vi 0 v 2 , 

where we used the property (3.49) of the tensor product in the last line. Thus parity 
is known as a multiplicative quantum number. This is due to the fact that the parity 
operator P is an element of the symmetry group, whereas most other observables 
are elements of the symmetry algebra (usually the Lie algebra corresponding to the 
symmetry group). □ 

Our next order of business is to clarify what it means for an object to “transform 
like a tensor”. We already addressed this to a certain degree in Sect. 3.2 when we 
discussed change of bases. There, however, we looked at how a change of basis af- 
fects the component representation of tensors, which was the passive point of view. 
Here we will take the active point of view, where instead of changing bases we will 
be considering a group G acting on a vector space V via some representation n. 
Taking the active point of view should nonetheless give the same transformation 
laws, and we will indeed see below that by considering representations of G on ten- 
sor products of V and V* and examining their component form, we will reproduce 
the formulas from Sect. 3.2, just in the active form. 

Recall from Sect. 3.5 that the set of tensors of rank (r, s) on a vector space V is 
just 


T s r (V) = V*®---® V*0Y 0---0 V. (5.31) 

v '“ v— ' v '“ v ' 

r times s times 

Given a representation n of G on V, we would like to extend this representation to 
the vector space T S ' (V). To do this, we need to specify a representation of G on V*. 
This is easily done: V* is a vector space of functions on V (just the linear functions, 
in fact), so we can use (5.9) to obtain the dual representation (n*, V*), defined as 


(n*/)(u) = /( n g _n>), geG, feV*, veV. 


(5.32) 


This representation has the nice property that if {e, } and {e 1 } are dual bases for V 
and V*. then the bases { 11 (g)c j j } and j n*(g)e' } are also dual to each other for any 
geG. You will check this in Exercise 5.12 below. 

With the dual representation in hand, we can then consider the tensor product 
representation n£ = n* 0 • • ■ 0 n* 0 n 0 ■ ■ • 0 n, which is given by 


r times 


s times 
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(nj(g))(/i 0 ■ • ■ 0 f r 0 V\ <S> • • • 0 V s ) 

= n*(g)/i 0 ■ • • 0 n*(g)/ r < 8 ) n(g)pi ® ■ • • 0 n(g)iv 


(5.33) 


If G is a matrix Lie group, it is easy to check that the corresponding Lie algebra 
representation n r s is given by 


« (g)) (/l 0 • • • ® fr 0 Vl 0 • • • 0 V s ) 

= 7r*(g)/l 0 • ■ ■ 0 /r <8 HI 0 ■ ■ • <g> Vj 

+ f\ 0 JT*(g )/2 0 • • • 0 fr 0 Vl 0 ■ ■ ■ 0 v s H 

+ /i ® • • • ® n*(g)fr ® Vl ® • • • ® Vi 

+ /l ® • • • ® fr ® jr(g)vi ® • • • ® Vi 

+ /l ® • • • ® fr ® V! ® jr(g)v 2 0 • ■ ■ ® Vi H 

+ /l 0 - - • ® fr 0 v\ ® - • - ® 7r(g)Vj. (5.34) 


We will get a handle on these formulas by considering several examples. 

Exercise 5.12 Let (n, V) be a representation of some group G and (n*, V *) its dual rep- 
resentation. Show that if the bases {e/ } and [e'} are dual to each other, then so are {n(g)e, } 
and {n*(g)e'} for any g eG. 

Exercise 5.13 Alternatively, we could have defined n( by thinking of tensors as functions 
on V and using the idea behind (5.9) to get 


(nj(g)r)(vi,...,v r ,/i,...,/j) 

= L(n(g _ 1 )ti 1 , . . . , n(g _ 1 )tj r , n*(g- l )fu . . . , n ’(g” 1 )/,). 

Expand T in components and show that this is equivalent to the definition (5.33). 


Example 5.13 The dual representation 

Let us consider (5.33) with r — 1, s = 0, in which case the vector space at hand is 
just V* and our representation is just the dual representation (5.32). What does this 
representation look like in terms of matrices? For arbitrary f e V* , v e V we have 

(n*/)(v) = [n*/f[v] 

= ([n*][/]) r [v] 

= [/] r [n*] r [v] 

as well as 


(n*/)(v) = /(n g _tv) 

= [/][n g -i][v] 


which implies 


[ n g] r = [n g -i] = [n- 1 ] = try 1 . 
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(Why are those last two equalities true?) We thus obtain 




(5.35) 


In other words, the matrices representing G on the dual space are just the inverse 
transposes of the matrices representing G on the original vector space! This is just 
the active transformation version of (3.30). Accordingly, if (n, V ) is the fundamen- 
tal representation of 0(n) or SO(n), then the matrices of the dual representation are 
identical to those of the fundamental, which is just another expression of the fact 
that dual vectors transform just like regular vectors under orthogonal transforma- 
tions (passive or active). 

If G is a matrix Lie group then (II, V ) induces a representation (jr, V ) of g, and 
hence (II*, V*) should induce a representation (tt*, V*) of g as well. What does 
this representation look like? Well, by the definition of n*, we have (for any X e g, 
f eV*,veV) 


(it\X)f){v) = j t (n*(e' x )f)(v) 

= JA n ^»L 

= f(-7t{X)v). 

You will show below that in terms of matrices this means 


[7r*(X)] = -[7r(Y)] 7 '. 


(5.36) 


Note again that if jt is the fundamental representation of O(n) or SO(n ) then 
[tt(X)] = X is antisymmetric and so the dual representation is identical to the orig- 
inal representation. 


Exercise 5.14 Prove (5.36). This can be done a couple different ways, either by calculating 
the infinitesimal form of (5.35) (by letting g = e ,x and differentiating at t = 0) or by a 
computation analogous to the derivation of (5.35). 

Example 5.14 (II [ , C(V )): The linear operator representation 

Now consider (5.33) with (r, s) — (1, 1). Our vector space is then just Tj 1 = C(V), 
the space of linear operators on V ! Thus, any representation of a group G on a 
vector space V leads naturally to a representation of G on the space of operators 
on V. What does this representation look like? We know from (5.33) that G acts 
on Tj 1 by 

n!(g)(/<g>t0 = (n*/)®(rv), veV,feV*, geG. (5.37) 

This is not very enlightening, though. To interpret this, consider / ® v as a linear 
operator T on V, so that 


T(w) = f(w)v, weV. 
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Then (careful working your way through these equalities!) 

(n|(g)r)(w) = ((n*/)(iu))n g u 

= /( n^iw)n g u 

= n s (/(n r iiw)u) 
= n,(r(n- 1 ( U ;))) 
= (n,rn- 1 )(u;) 

so we have 


nj(g)r = n^rn” 1 . 


(5.38) 


It is easy to check that this computation also holds for an arbitrary T e C(V), since 
any such T can be written as a linear combination of terms of the form f ®v. Thus, 
(5.39) tell us that the tensor product representation of G on V* ® V — £(V) is just 
the original representation acting on operators by similarity transformations! This 
should not be too surprising, and you perhaps could have guessed that this is how 
the action of G on V would extend to C( V). Representing (5.38) by matrices yields 


[n}(g)r] = [n g ]m[n s r 1 (5.39) 

which is just the active version of (3.33). 

If G is a matrix Lie group we can also consider the induced Lie algebra rep 
(tTj , £(U)), which according to (5.27) acts by 

(W (*))(/ ® w) = (**(*)/) ® V + f ® 7t(X)v. (5.40) 

Since we saw in Chap. 4 that the adjoint representation of G induces the adjoint 
representation of g, we expect that tc\ should act by the commutator. This is in fact 
the case, and you will show below that 


Tt\(X)T = [n(X),T\ 


(5.41) 


This, of course, reduces to a commutator of matrices when a basis is chosen. □ 


Exercise 5.15 Prove (5.41). As in Exercise 5.14, this can be done by either computing the 
infinitesimal form of (5.38), or performing a calculation similar to the one above (5.38), 
starting with the Lie algebra representation associated to (5.37). 

It should come as no surprise that the examples above reproduced the formulas 
from Sect. 3.2, and in fact it is easy to show that the general tensor product repre- 
sentation (5.33) is just the active version of our tensor transformation law (3.22). 
Using the fact that 


(n;)/ = (□,-, >7, 

which you will prove below, we have for an arbitrary (r. s) tensor 7', 


(5.42) 
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nj(g) 7 ’ = n r s (.g)(Ti 1 .j r il - j ‘e 11 ®e h ® <8>e js ) 

= T i: .j r j ' n* g e h ® ■ ■ • ® n*e' v ® n g e h <g> • ■ • <g> n 4 ,e /s 
= ' • ' ( n g 1 ) / t r !r (n g ) 7 - 1 /l • ■ • (n g )j s ls T h „. ir h - j *e k ' ®---®e kr 

®ei l ®---®ei s (5.43) 

so that, relabeling dummy indices, we have 

= (n- 1 )^ 1 • • ■ (n- 1 ) I >(n g ),/> ■ ■ • (n g )^T h ... kr 1 ^ (5.44) 

which is just the active version of (3.22), with 11(g) replacing A and 11(g) -1 re- 
placing A~ l . 

Exercise 5.16 Verify (5.42). 


5.4 Symmetric and Antisymmetric Tensor Product 
Representations 

With the tensor product representation now in place, we can now consider symmet- 
ric and antisymmetric tensor product representations. Note that for tensors of type 
(r, 0) or (0, r) the tensor product representation (5.33) is symmetric, in the sense 
that all the factors in the tensor product are treated equally. A moment’s thought 
then shows that if we have a completely symmetric tensor T e S' (V). then n ° (g ) T 
is also in S r (V). The same is true for the completely antisymmetric tensors A'P, 
and for the spaces S r (V*) and A r V*. Thus these subspaces of T®(V) and Tq (V) 
furnish representations of G in their own right, and we will see below that many of 
these representations are familiar. 

Example 5.15 S'( C 2 ) 

Consider S 1 (C 2 ), the completely symmetric (0, /) tensors on C 2 . By way of exam- 
ple, when / = 3 a basis for this space is given by 

vq = e\ <g> e\ ® e\ 

ui = e2 <S> e\ <g> e\ + e\ <g> ei <S> e\ + e\ <g> e\ <g> e-2 
H2 = e2 ® ez <S> e\ + e\ <g> ei ® e2 + e2 <S> e\ <g> e2 
V 3 = e2 ® e2 ® ej 

and in general we have 

vq = e\ ® e\ <g> • • • <g> e\ 

Hi e\ <g> • • • <g> ei + permutations 

H 2 = H 2 <S> e 2 ® ei ® ® ei + permutations 
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vi-\ = ej <8> ei <8> ■ ■ ■ <8> e\ + permutations 


vi =e 2 ® e 2 ® ■ ■ ■ ® e 2 - 

The tensor product representation (TI^, 7j°(C 2 )) restricts to S 1 ( C 2 ) C 7j°(C 2 ), 
yielding a representation of SU( 2) which we will denote as (5 ,/ n, S 1 (C 2 )). One 
can easily compute the corresponding su(2) representation, denoted (S'tt, S 1 (C 2 )), 
and it is easy to show that in the basis given above, 


[(S l 7t)(S i )]=i 


( 1/2 


\ 


1 / 2 - 1 


- 1 / 2 ) 


(5.45) 


which is the same as (5.11). This suggests that S l (C 2 ) is the same representation 
as T)(C 2 ), and is thus also the same as the spin 1/2 representation of su(2). We 
will soon see that this is indeed the case. Note that we have a correspondence here 
between symmetric (0, Z) tensors on a vector space and degree / polynomials on a 
vector space, just as we did in Example 3.24. 


Exercise 5.17 Verify (5.45). 


Example 5.16 Antisymmetric second rank tensors and the adjoint representation of 
0(n) 


Consider the fundamental representation (IT, R") of O(n). We can restrict the tensor 
product representation (IT®, R" ®R") to the subspace A 2 R" to get the antisymmet- 
ric tensor product representation of 0(n) on A 2 R", which we will denote as A 2 n. 
Let X — X l i ei <g> ej e A 2 R" with the standard basis. Then by (5.33), we have (for 
ReO(n)) 


A 2 Yl(R)(X) - X" Re, <g> Re j = X> ' R kiRljek ® e. 


which in terms of matrices reads 


k,l 


(5.46) 


[A 2 n(/?)X] = ^[Z]/? 7 ’ = ^[X]f?“ 1 . (5.47) 

So far we have just produced the active transformation law for a (0, 2) tensor, and we 
have not yet made use of the fact that X is antisymmmetric. Taking the antisymmetry 
of X into account, however, means that [X] is an antisymmetric matrix, and (5.47) 
tells us that it transforms under 0(n) by similarity transformations. This, however, 
is an exact description of the adjoint representation of O(n) ! So we conclude that 
the adjoint representation of 0(n ) (and hence of SO(n) and so («)) are the same as 
the tensor product representation (A 2 n, A 2 R"). □ 


Example 5.17 Antisymmetric tensor representations of 0(3) 

Consider the antisymmetric tensor representations (A* IT, A*R 3 ), k — 1, . . . , 3 of 
0(3), obtained by restricting (fl^, 7j.°(R 3 )) to A a 'R 3 C 7j°(R 3 ). For convenience 
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we will define A°R 3 to be the trivial representation on R (also known as the scalar 
representation). Now, we already know that A 1 ®! 3 = R 3 is the fundamental repre- 
sentation, and from the previous example we know that A 2 R 3 is the adjoint repre- 
sentation, also known as the pseudovector representation. What about A 3 R 3 ? We 
know that this vector space is one-dimensional (why?), so is it just the trivial repre- 
sentation? Not quite. Taking the Levi-Civita tensor e\ A e 2 A e 2 as our basis vector, 
we have 


(A 3 n(7?))(ei A ej A e 3 ) = (Re 1 ) A (Re 2 ) A (Re 3 ) 

= |/?|ei A ez A e 2 by (3.105) 

= ±ei A e 2 A e 2 . 

Thus e\ A e 2 A e 2 is invariant under rotations but is still not a scalar, since it changes 
sign under inversion. An object that transforms this way is known as a pseudoscalar, 
and A 3 R 3 is thus known as the pseudoscalar representation of 0(3). Note that if we 
restrict to .SO(3), then \R\ — 1 and A 3 R 3 is then just the scalar (trivial) representa- 
tion, just as A 2 R 3 is just the vector representation of SO( 3). Also, as we mentioned 
before, if we restrict our representations to Z 2 ~ {/, — 1} C 0(3) then they reduce 
to either the trivial or the alternating representation. We summarize all this in the 
table below. 


V 

0(3) 

50(3) 

{/,-/} ~Z 2 

A°R 3 

scalar 

scalar 

trivial 

a'r 3 

vector 

vector 

alt 

a 2 r 3 

pseudovector 

vector 

trivial 

a 3 r 3 

pseudoscalar 

scalar 

alt 


Example 5.18 Antisymmetric tensor representations of 0(3, 1) 

Here we repeat the analysis from the previous example but in the case of 0(3, 1). 
As above, A°R 4 = R is the trivial (scalar) representation. A 1 ®! 4 = R 4 is the vector 
representation, and A 2 R 4 is just known as the second rank antisymmetric tensor 
representation (or sometimes just tensor representation). How about A 3 R 4 ? To get 
a handle on that, we will compute matrix representations for the corresponding Lie 
algebra representation, using the following basis B for A 3 R 4 : 

fl = e 2 A es A e 4 
f 2 = —e\ A Q A f 4 
h =e\/\ei/\ e 4 

U = <T A e 2 A e 3 . 

You will check below that in this basis, the operators (A 3 jr)(Z;) and (A 3 7r) (K,) 
are given by 
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[(A 3 ; r)(L i )] B = L i (5.48) 

[(A \){K,)] B = Ki (5.49) 

so (A 3 7T, A 3 R 4 ) is the same as the fundamental (vector) representation of so(3, 1)! 
When we consider the group representation A 3 n, though, there is a slight difference 
between A 3 R 4 and the vector representation; in the vector representation, the parity 
operator takes the usual form 


/-I 0 0 

n < p > = p = 0 “o' -1 

v 0 0 0 

whereas on A 3 R 4 , we have (as you will check below) 

[A 3 n (P)] B = 


o\ 

0 

0 

1 / 


/I 

0 

0 

0 ^ 

0 

1 

0 

0 

0 

0 

1 

0 


0 

0 

-v 


(5.50) 


(5.51) 


which is equal to — P. Thus the elements of A 3 R 4 transform like 4-vectors under 
infinitesimal Lorentz transformations (and, as we will show, under proper Lorentz 
transformations), but they transform with the wrong sign under parity, and so in 
analogy to the three-dimensional Euclidean case, A 3 R 4 is known as the pseudovec- 
tor representation of <3(3, 1). As in the Euclidean case, if one restricts to 5(9(3, 1) 0 
then parity is excluded and then A 3 R 4 and R 4 are identical, but only as representa- 
tions of the proper Lorentz group 5(9(3, I )„. 

The next representation, A 4 R 4 , is one-dimensional but, as in the previous ex- 
ample, is not quite the trivial representation. As above, we compute the action of 
A 4 n(A), A e (9(3, 1) on the Levi-Civita tensor e\ A ej A e^, A e\\ 


(A 4 n(A))(ei A e 2 A A 64 ) = ( Ae \ ) A (Ac 2 ) A (A<? 3 ) A (Ae 4 ) 

= I A I Cl A C 2 A C 3 A C4 
= ±ei A C 2 A 63 A «4 

and so ci A e 2 A e 3 A e 4 is invariant under proper Lorentz transformations but changes 
sign under improper Lorentz transformations. As in the Euclidean case, such an ob- 
ject is known as a pseudoscalar, and so (A 4 n, A 4 R 4 ) is known as the pseudoscalar 
representation of (9(3, 1). 

As in the previous example, we can use { I, P } ~ Z 2 to distinguish between A 3 R 4 
and R 4 , as well as between A 4 R 4 and the trivial representation. The operators n(.P) 
and A 3 n(.P) can be distinguished in a basis-independent way by noting that Y\( P) 
is diagonalizable with eigenvalues {— 1 , — 1 , — 1 , 1}, whereas A 3 n (P ) has eigenval- 
ues { 1 , 1 , 1 , — 1 } . We again summarize in a table, where in the last column we write 
the eigenvalues of A*n(P) in those cases where A 4 n ( T ) ^ ±7: 
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V 

0(3, 1) 

50(3, 1) 0 

{I.P}-%2 

A°R 4 

scalar 

scalar 

trivial 

A'R 4 

vector 

vector 

{-1,-1, -1,1) 

a 2 r 4 

tensor 

tensor 

{1, 1. 1,-1. -1,-1) 

a 3 r 4 

pseudovector 

vector 

{1,1,1,-!} 

a 4 r 4 

pseudoscalar 

scalar 

alt 


Exercise 5.18 By explicit computation, verify (5.48), (5.49), and (5.51). Also, show that 
A 2 n(P) has eigenvalues {—1, —1, —1, 1, 1, 1}. You may need to choose a basis for A 2 R 4 
to do this. 


5.5 Equivalence of Representations 


In the previous section we noted that the vector and dual vector representations of 
Oin) (and hence SO(n) and so(«)) were 'the same’, as were the adjoint represen- 
tation of 0(n ) and the antisymmetric tensor product representation (A 2 n, A 2 R"). 
We had also noted a few equivalences in Sect. 5.1, where we pointed out that the ad- 
joint representation of SO(3) was, in a certain matrix representation, identical to the 
vector representation of SO(3), and where we claimed that the adjoint representa- 
tion of so (3, 1) is equivalent to the antisymmetric second rank tensor representation. 
However, we never made precise what we meant when we said that two representa- 
tions were ‘the same’ or ‘equivalent’, and in the cases where we attempted to prove 
such a claim, we usually just showed that the matrices of two representations were 
identical when particular bases were chosen. Defining equivalence in such a way is 
adequate but somewhat undesirable, as it requires a choice of basis; we would like 
an alternative definition that is more intrinsic and conceptual and that does not re- 
quire a choice of coordinates. The desired definition is as follows: Suppose we have 
two representations (Hi , Vi) and ( YI 2 , Vj) of a group G. A linear map 0 : Vj — > Vj 
which satisfies 


n 2 (g)(<t>(v)) = ct>(n l (g)v) Vu e Vj, g e G 


(5.52) 


is said to be an intertwining map or intertwiner. This just means that the action 
of G via the representations commutes with the action of 0. If in addition 0 is a 
vector space isomorphism, then (II 1 , V\) and (ID, Vj) are said to be equivalent. 
Occasionally we will denote equivalence by (ni, V \ ) ~ (IT 2 , Vj). Equivalence of 
Lie algebra representations and their corresponding intertwining maps are defined 
similarly, by the equation 


7T 2 (X)((p(v)) = (P(tti(X)v) WveVuXeg. (5.53) 

Another way to write (5.52) is as an equality between maps that go from Vj to Vj: 


ri2(g) °0 = 0o Ili(g) VgeG 


(5.54) 
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and likewise for Lie algebra representations. When tp is an isomorphism, this can 
be interpreted as saying that n i (g) and n 2 (g) are the ‘same’ map, once we use the 
intertwiner cp to identify V\ and V?. Another way to interpret this is to choose bases 
for Vi and V 2 and then write (5.54) as 

[n 2 (g)][0] = W>][n!(g)] wgeG 
or 

[n 2 fe)] = [0][n 1 (g)][0]- 1 VgeG 

which says that the matrices [n 2 (g)] and [n j (g)] are related by a similarity trans- 
formation. You will use this below to show that our definition of equivalence of 
representations is equivalent to the statement that there exists bases for V\ and V 2 
such that [Lli (g)] = [n 2 (g)] Vg e G (or the analogous statement for Lie algebras). 

Exercise 5.19 Show that two representations f n i , Vi) and (LL, VS) are equivalent if and 
only if there exist bases Si C Vi and C VS such that 

[ni(g)] Bl = [ n 2(g)] B2 Vg e G. (5.55) 

Exercise 5.20 Let (11,- , V) ). / = 1 , 2 be two equivalent representations of a group G , and let 
H C G be a subgroup. Prove that restricting fl, : G — > GL(Vi) to maps LI; : H — >■ GL(Vi) 
yield representations of H , and that these representations of H are also equivalent. Thus, 
for example, equivalent representations of 0(n) yield equivalent representations of SO(n), 
as one would expect. 

Before we get to some examples, there are some immediate questions that arise. 
For instance, do equivalent representations of a matrix Lie group G give rise to 
equivalent representations of g, and conversely, do equivalent representations of g 
come from equivalent representations of G? As to the first question, on philosoph- 
ical grounds alone we would expect that since g consists of ‘infinitesimal’ group 
elements, equivalent group representations should yield equivalent Lie algebra rep- 
resentations. This is in fact the case: 

Proposition 5.1 Let G be a matrix Lie group, and let (LI, , V,), i = 1,2 be two 
equivalent representations of G with intertwining map <p. Then (p is also an inter- 
twiner for the induced Lie algebra representations (jr Vi), and so the induced Lie 
algebra representations are equivalent as well. 

Proof We proceed by direct calculation. Since (p is an intertwiner between (FI i , V \ ) 
and (n 2 , V 2 ) we have 

U 2 (e ,x )((p(v)) = <p(ni(e ,x )v) VneFi, A e g, t e R. 

Taking the t derivative of the above equation, evaluating at t = 0, and using the 
definition of the induced Lie algebra representations tt, , as well as the fact that (p 
commutes with derivatives (cf. Exercise 4.34), we obtain 

n 2 {X)((p(vj) =(P(tt\{X)v) Vreki, Xeg 
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and so tt\ and tti are equivalent (you should explicitly confirm this as an exercise if 


□ 


more detail is needed). 


The second question, of whether or not equivalent representations of g come 
from equivalent representations of G, is a bit trickier. After all, we noted in the in- 
troduction to this chapter (and will see very concretely in the case of so ( 3 )) that not 
every representation of g necessarily comes from a representation of G. However, 
if we know that two equivalent Lie algebra representations (7 17, V;), i = 1,2 actu- 
ally do come from two group representations (n,-, Vi), and we know the group is 
connected, then it is true that T1 1 and YI2 are equivalent. 

Proposition 5.2 Let G be a connected Lie group and let (n, , V;), i = 1,2 be two 
representations of G, with associated Lie algebra representations ( m , Vi). If the 
Lie algebra representations (jti, Vi) are equivalent, then so are the group represen- 
tations (n,-, Vi) from which they came. 

Proof The argument relies on the following fact, which we will not prove (see Hall 
[8] for details): if G is a connected matrix Lie group, then any g e G can be written 
as a product of exponentials. That is, for any g e G there exist X, e g, f, e K, 
i = \, ... ,n such that 


g = e , ' x 'e t2Xl ---e tnXn 


(5.56) 


(In fact, for all the connected matrix Lie groups we have met besides SL{ 2 , C), 
every group element can be written as a single exponential. For SL( 2, C), the polar 
decomposition theorem [see Problem 4 . 6 ] guarantees that any group element can 
be written as a product of two exponentials.) With this fact in hand we can show 
that ni and TI2 are equivalent. Let (p be an intertwining map between (tti, Vi) and 
( 7 x 2 , V 2 ). Then for any g e G and v e V\ we have 

n2(g)(0(iO) 

= n 2 (e ,lXl e ,2X2 ---e , ” x ")((f>(v)) 

= Yl2(e ,lXl )n2(e ,2X2 ) ■■■ TI2 (e'" Xn )(cp{v)) since n2 is a homomorphism 
= (e ,,K 2 <X] le t27T2<X2 ^ ■ ■ ■ e ,n 7 T 2 <Xn> ')((p(v)j by definition of 1x2 
= e t 2 n V x 2 ) . . . gtnXiWn) v'j by Exercise 5.21 below 

= 0 (ni (e riX| )ni (e^ 2 ) • • • n l {e tnXn )v) by definition of jti 
= </>(ni (e t,x, e tlX2 ■ ■ ■ e ,nX "^v) since ni is a homomorphism 
= 0(niCg)u) 

and so n 1 and n2 are equivalent. □ 

Proposition 5.2 is useful in that it allows us to prove equivalence of group rep- 
resentations by examining the associated Lie algebra representations, which are (by 
virtue of linearity) often easier to work with. 
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Exercise 5.21 Let n\ and Hi be two equivalent representations of a Lie algebra g with 
intertwining map <f>. Prove by expanding the exponential in a power series that 

e «2(X) o0 = </>o VIefl. 

Exercise 5.22 We claimed above that when a matrix Lie group G is connected, then any 
group element can be written as a product of exponentials as in (5.56). To see why the 
hypothesis of connectedness is important, consider the disconnected matrix Lie group 0(3) 
and find an element of 0(3) that cannot be written as a product of exponentials. 

Now it is time for some examples. 

Example 5.19 Equivalence of so(3) and R 3 as .S' (9(3) representations 


We know that the adjoint representation and the fundamental representation of so (3) 
are equivalent, by Exercise 5.19 and the fact that in the [L,}, = i _3 basis we have 
[ad/,, ] = Lj . What, then, is the intertwining map between R 3 and so(3)? Simply the 
map 

f : so(3) — » R 3 

/ 0 -z y \ 

I z 0 -i (x, y, z ) 

\-y x o / 

which is just the map X i-»- [Y]g, where B — {L, } != 1 - 3 . To verify that this is an in- 
tertwiner we will actually work on the Lie algebra level, and then use Proposition 5.2 
to conclude that the representations are equivalent on the group level. To verify that 
4> satisfies (5.53), one need only prove that the equation holds for an arbitrary basis 
element of so(3); since the 7 r, are linear maps, we can expand any X e so(3) in 
terms of our basis and the calculation will reduce to verifying the equality just for 
the basis elements. (This is the advantage of working with Lie algebras; facts about 
representations [such as equivalence] are usually much easier to establish directly 
for Lie algebras than for the corresponding groups, since we can use linearity.) We 
thus calculate, for any Y e so(3), 


( 0 oadz. ; )(F) = 0 (adL,.T) 
= [ad/„ Y] 

= [ad/.,][K] 
= L t [Y] 


while 


{Li o <j>)(Y) — Lj[Y], 

Thus (f> is an intertwining map and the fundamental and adjoint representations of 
50(3) are equivalent. 

By Proposition 5.2, we can then conclude that the adjoint and fundamental repre- 
sentations of 50(3) are equivalent, since 50(3) is connected. What about the adjoint 
and fundamental representations of O (3) ? You will recall that O (3) is not connected 
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(and in fact has two separate connected components), so we cannot conclude that its 
adjoint and fundamental representations are equivalent. In fact, as we mentioned be- 
fore, we know that these representations are not equivalent, since if (f> : so(3) — »■ M 3 
were an intertwining map, we would have for any Y e so(3), 

(f>(Ad-iY) = 4>{Y) since Ad_/ is the identity 

as well as 

< />(Ad_/7 ) = (— I)(p(Y ) since 0 is an intertwiner 

= ~<P(Y), 

a contradiction. Thus <p cannot exist, and the fundamental and adjoint representa- 
tions of 0(3) are inequivalent. 

Exercise 5.23 Use Example 5.17 and Exercise 5.19 to deduce that R and A 3 R 3 are equiva- 
lent as 50(3) representations (in fact, they are both the trivial representation). Can you find 
an intertwiner? Do the same for the 50(3, 1) 0 representations R and A 4 R 4 , using the results 
of Example 5.18. Using Proposition 5.2, also conclude that R 4 and A 3 R 4 are equivalent as 
S0(3, 1 ) 0 representations. 

Exercise 5.24 Reread Example 3.29 in light of the last few sections. What would we now 
call the map J that we introduced in that example? 

Example 5.20 Vector spaces with metrics and their duals 

Let V be a vector space equipped with a metric g (recall that a metric is any symmet- 
ric, non-degenerate bilinear form), and let (n, V) be a representation of G whereby 
G acts by isometries, i.e. 11 (g) e Isom(U) Vg e G. Examples of this include the 
fundamental representation of 0(n ) on M" equipped with the Euclidean metric, or 
the fundamental representation of 0(n — 1, 1) on M" with the Minkowski metric, 
but not U (n) or SU(n) acting on C" with the standard Hermitian inner product (why 
not?). If the assumptions above are satisfied, then (n, V ) is equivalent to the dual 
representation (n*, V*) and the intertwiner is nothing but our old friend 

L \V V* 
v h* g(v, •)■ 

You will verify in Exercise 5.25 that L is indeed an intertwiner, proving the asserted 
equivalence. Thus, in particular we again reproduce (this time in a basis-independent 
way) the familiar fact that dual vectors on n -dimensional Euclidean space transform 
just like ordinary vectors under orthogonal transformations. Additionally, we see 
that dual vectors on n -dimensional Minkowski space transform just like ordinary 
vectors under Lorentz transformations! See Exercise 5.26 for the matrix manifesta- 
tion of this. 

The reason we have excluded complex vector spaces with Hermitian inner prod- 
ucts from this example is that in such circumstances, the map L is not linear (why 
not?) and thus cannot be an intertwiner. In fact, the fundamental representation of 
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SU(n ) on C" for n > 3 is not equivalent to its dual, 9 and the dual representation in 
such circumstances can sometimes be interpreted as an antiparticle if the original 
representation represents a particle. For instance, a quark can be thought of as a 
vector in the fundamental representation C 3 of 5(7(3) (this is sometimes denoted as 
3 in the physics literature), and then the dual representation C 3 * corresponds to an 
antiquark (this representation is often denoted as 3). For 5(7(2), however, the funda- 
mental (spinor) representation is equivalent to its dual, though that does not follow 
from the discussion above. This equivalence is the subject of the next example. 

Exercise 5.25 Let V be a vector space equipped with a metric g and let (FI, V ) be a 
representation of a group G by isometries. Consider L : V — > V* as defined above. Prove 
that toIlg = If* o L for all g e G. Since (L o Flg)(i>) e V* for any v e V, you must show 
that 


(tongKt) = (n;oi)(») 


as dual vectors on V, which means showing that they have the same action on an arbitrary 
second vector w . 

Exercise 5.26 Let (n, R 4 ) be the fundamental representation of 0(3, 1) on R 4 , and let 
B = {£/}[■— 1—4 be the standard basis for R 4 and B* = {e'}i= 1-4 the corresponding dual 
basis. Find another basis B* for R 4 * such that 


[n*( g )] s „=[n ( g)] 8 . 


The map L might help you here. Does this generalize to the fundamental representation of 
0(n — 1, 1) on n -dimensional Minkowski space? 

Example 5.21 The fundamental (spinor) representation of 5(7(2) and its dual 

Consider the fundamental representation (FI, C 2 ) of 5(7(2) and its dual represen- 
tation (F1*,C 2 *). We will show that these representations are equivalent, by first 
showing that the induced Lie algebra representations are equivalent and then invok- 
ing Proposition 5.2. We will show equivalence of the Lie algebra representations by 
exhibiting a basis for C 2 * that yields the same matrix representations for (n* , C 2 *) 
as for (it, C 2 ). For an intrinsic (coordinate-independent) proof, see Problem 5.2. 

As should be familiar by now, the fundamental representation of su(2) is just the 
identity map: 



This combined with (5.36) tells us that, in the standard dual basis B*, 


9 We will not prove this here; see Hall [8] for details. 
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Now define a new basis B* — {e l , e 2 } for C 2 * 




You can check that the corresponding change of basis matrix A is 


A = 



(5.57) 


and that in this basis the operators 7t*(Sj) are given by 

|>r = A[7T*(S x )] Bt A~' = S x 

[7r*(5 v )]^ = A[jt*(S y )] Br A~' = S y (5.58) 

[tt*(S,)] b *, = A[n*{S z )\ B *A- 1 = S z 

and thus (jt* , C 2 *) is equivalent to (jt, C 2 ). The connectedness of SU( 2) and Propo- 
sition 5.2 then imply that (n*, C 2 *) is equivalent to (PI, C 2 ), as claimed. 


Exercise 5.27 Verify (5.57) and (5.58). 

Exercise 5.28 Extend the above argument to show that the fundamental representation of 
SL( 2, C) is equivalent to its dual. 


Example 5.22 Antisymmetric second rank tensors and the adjoint representation of 
orthogonal and Lorentz groups 


We claimed in Sect. 5.1 that for the Lorentz group, the adjoint representation is 
‘the same’ as the antisymmetric second rank tensor representation (the latter being 
the representation to which the electromagnetic field tensor F ,JV belongs). We also 
argued at the end of Sect. 5.3 that the same is true for the orthogonal group 0(n). 
Now it is time to precisely and rigorously prove these claims. Consider M" equipped 
with a metric g, where g is either the Euclidean metric or the Minkowski metric. The 
isometry group G — lsom( V) is then either 0(n) or 0(n — 1 , 1 ), and g is then either 
so(n) or so(n — 1,1). To prove equivalence, we need an intertwiner </> : A 2 R" — > g. 
Here we will define (p abstractly and use coordinate-free language to prove that it is 
an intertwiner. The proof is a little long, requires some patience, and may be skipped 
on a first reading, but it is a good exercise for getting acquainted with the machinery 
of this chapter; for an illuminating coordinate proof, see Problem 5.1. 

To define (j> abstractly we interpret g not as a space of matrices but rather as linear 
operators on R". Since A 2 R" is just the set of antisymmetric (0, 2) tensors on JR", 
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we can then define <p by just using the map L to Tower an index’ on an element 
T e A 2 R", converting the (0, 2) tensor into a (1,1) tensor, i.e. a linear operator. 
More precisely, we define the linear operator <p(T) as 

<KT)(v, f) = T(L(v), /), v e R", / e R"*. 

We must check, though, that <p(T) e g. Earlier, we had characterized g by a matrix 
condition like antisymmetry or the condition (4.70). Now that we are thinking of g as 
a space of linear operators, however, we need another characterization. Fortunately, 
we have already discussed this; as in (5.1), we can write 

g[e tX v, e ,x w) = g( v, w ) Vi>, w eV, Xeg, f eR 

and then differentiate at t — 0 to obtain 

g(Xv, w) + g(v, Xw) = 0, (5.59) 

an alternative definition of g which you can easily show is equivalent to the matrix 
conditions we used originally (in fact, just writing out (5.59) in matrices leads to the 
antisymmetry condition in the Euclidean case, and (4.70) in the Lorentzian case). 
Thus, to show that the range of (p really is g, we just need to show that (p(T) as 
defined above satisfies (5.59): 

g(<P(T)v, w ) +g(v, <p{T)w) 

= <p(T)(v , L(w)) + </>(T)(w, L(v)) by definition of L 
= T (L(v), L(w )) + T (L(w), L( i>)) by definition of 4>{T) 

= 0 by antisymmetry of T. 

Thus <p(T) really is in g. To show that (j> is an intertwiner, we need to show that 
(p o A 2 n(7?) = Adi K) o <p for all R e G. To do this, we will employ the alternate 
definition of the tensor product representation given in Exercise 5.13, which in this 
case says 

(A 2 n(R)T)(f,h)=T(n*(R- l )f,n*(R~ l )h), ReG , f,heR n *. (5.60) 

We then have, for all v € R", / e R"* (careful with all the parentheses!), 

((0 o A 2 n(/?))(T))( V , /) = (A 2 n(tf)cr))(L(p), /) 

= T(n*(R~ 1 )L(v), n *(R~ l )f) 

= T(L(R- l v),n*(R~ l )f) 

where in the last equality we used the fact that L is an intertwiner between R" 
and R"*. We also have 

((Ad(tf) O 4>)(T))(v, /) = <P(T)(R~ 1 V, n *(R~ 1 )f) 

= T(L(R- l v),n*{R~ l )f) 

and we can thus conclude that (p o (A 2 fl (/?)) = Ad (R) o <p , as desired. 
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So what does all this tell us? The conclusion that the tensor product representa- 
tion of G on antisymmetric second rank tensors coincides with the adjoint represen- 
tation of G on g is not at all surprising in the Euclidean case, because there g is just 
so(n), the set of all antisymmetric matrices! In the Lorentzian case, however, we 
might be a little surprised, since the matrices in so(n — 1,1) are not all antisymmet- 
ric. These matrices, however, represent linear operators, and if we use L to convert 
them into (0, 2) tensors (via </> _1 ) then they are antisymmetric! In other words, we 
can always think of the Lie algebra of a group of metric-preserving operators 
as being antisymmetric tensors, though we may have to raise or lower an index 
(via L ) to make this manifest. 

This is actually quite easy to show in coordinates; using the standard basis for R" 
and the corresponding components of X — Xj J e‘ <E) e j eg (still viewed as a linear 
operator on M"), we can plug two basis vectors into (5.59) and obtain 

0 = g(Xe-i,ej) + g(ei, Xej) 

= Xj k g(e k , e j) + Xj k g(ei,e k ) 

= Xi k g kj + X j k gi k 
= Xij + Xji 

and so the (2, 0) tensor corresponding to X e g is antisymmetric! □ 

Our last example takes the form of an exercise, in which you will show that 
some of the representations of SU{2) on P/(C 2 ), as described in Example 5.7, are 
equivalent to more basic SU( 2) representations. 

Exercise 5.29 Prove (by exhibiting an intertwining map) that (,T| , Pi(C 2 )) from Exam- 
ple 5.7 is equivalent to the fundamental representation of su(2). Conclude (since SU(2) is 
connected) that ( n i , Pi(C 2 )) is equivalent to the fundamental representation of SU( 2). Do 
the same for (jT2, P2OC)) and the adjoint representation of su(2) (you will need to consider 
a new basis that consists of complex linear combinations of the elements of 82)- 


5.6 Direct Sums and Irreducibility 

One of our goals in this chapter is to organize the various representations we have 
met into a coherent scheme, and to see how they are all related. Defining a notion of 
equivalence was the first step, so that we would know when two representations are 
‘the same’. With that in place, we would now like to determine all the (inequivalent) 
representations of a given group or Lie algebra. In general this is a difficult problem, 
but for most of the matrix Lie groups we have met so far and their associated Lie 
algebras, there is a very nice way to do this: for each group or algebra, there exists a 
denumerable set of inequivalent representations (known as the “irreducible” repre- 
sentations) out of which all other representations can be built. Once these irreducible 
representations are known, any other representation can be broken down into a kind 
of “sum” of its irreducible components. In this section we will present the notions 
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of irreducibility and sum of vector spaces, and in subsequent sections we will enu- 
merate all the irreducible representations of SU(2), SO( 3), SO( 3, 1), 57.(2, C) and 
their associated Lie algebras. 

To motivate the discussion, consider the vector space M n (R) of all n x n real ma- 
trices. There is a representation n of O(n ) on this vector space given by similarity 
transformations: 


FI(7?)A = 7;AI? _1 , R e 0(n), Ae M n Q R). 

If we consider M„(R) to be the matrices corresponding to elements of £(R"), then 
this is just the matrix version of the linear operator representation II [ described 
above, with V — R" . Alternatively, it can be viewed as the matrix version of the 
representation IT, on R" ® R" , with identical matrix transformation law 

[n?(/?)(r)] = = R[T]R~\ T eR" ®R". 

Now, it turns out that there are some special properties that A e M„ (R) could have 
that would be preserved by 11(7?). For instance, if A is symmetric or antisymmetric, 
then RAR~ l is also; you can check this directly, or see it as a corollary of the discus- 
sion above Example 5.16. Furthermore, if A has zero trace, then so does RAR~ { . 
In fact, we can decompose A into a symmetric piece and an antisymmetric piece, 
and then further decompose the symmetric piece into a traceless piece and a piece 
proportional to the identity, as follows: 

A= X -(A + A T )+ l -(A-A T ) (5.61) 

= -(Tr A)7 + Ua + A t - 2 (Tr A)/) + i(A - A T ). (5.62) 
n 2 \ n J 2 

(You should check explicitly that the first term in (5.62) is proportional to the iden- 
tity, the second is symmetric and traceless, and the third is antisymmetric.) Fur- 
thermore, this decomposition is unique, as you will show below. If we recall the 
definitions 


S n (R) = {Me M„(R) | M = M r ) 

A„(R) = {Me M„(R) | M = -M r {, 
and add the new definitions 

S' n (R) = {Me M„ (R) | M = M T , TrM = 0} 

R7 = {M e M„(R) | M = cl, ceR}, 

then this means that any A e M n (R) can be written uniquely as a sum of elements 
of S' n , A„ and R7, all of which are subspaces of M„(R). This type of situation 
turns up frequently, so we formalize it with the following definition: If V is a vector 
space with subspaces Wi , Wo, . . . , HR such that any v e V can be written uniquely 

as v = w i + v )2 H b Wk, where w, e Wj , then we say that V is the direct sum of 

the Wj and we write 
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V = Wi ® W 2 ® ■ • ■ ® W k 

which is sometimes abbreviated as V = @f =1 VV, . The decomposition of a vector 
i> is sometimes written as v = (w i, . . . , Wk). In the situation above we have, as you 
can check, 


M„ (M) = S„ (K) © A n (M) = RI © S' n (K) © A n (K) . (5 .63) 

Exercise 5.30 Show that if V = ©* =1 Wi, then Wi n Wj = {0} Vi ^ j, i.e. the intersection 
of two different W, is just the zero vector. Verify this explicitly in the case of the two 
decompositions in (5.63). 

Exercise 5.31 Show that V = ©f =1 Wi is equivalent to the statement that the set 


where each S; is an arbitrary basis for Wi, is a basis for V. 

Exercise 5.32 Show that M n (C) = H n (C) © u(«). 

Our discussion above shows that all the subspaces that appear in (5.63) are in- 
variant, in the sense that the action of 11(A) on any element of one of the subspaces 
produces another element of the same subspace (i.e. if A is symmetric then V\(R)A 
is also, etc.). In fact, we can define an invariant subspace of a group representa- 
tion (n, V) as a subspace W c V such that fl(g)w e W for all g e G, w e W . 
Invariant subspaces of Lie algebra representations are defined analogously. Notice 
that the entire vector space V, as well as the zero vector {0}, are always (trivially) 
invariant subspaces. An invariant subspace W C V that is neither equal to V nor to 
{0} is said to be a non-trivial invariant subspace. Notice also that the invariance of 
W under the 11(g) means we can restrict each 11(g) to W (that is, interpret each 
11(g) as an operator Id (g)| w e GL(W )) and so obtain a representation ( 11 1 u- . W) 
of G on IT. Given a representation V, the symmetric and antisymmetric subspaces 
S r (V) and A r (V ) of the tensor product representation 7)° ( V ) are nice examples 
of non-trivial invariant subspaces (recall, of course, that A r (V) is only non-trivial 
when r < dim V). 

Non-trivial invariant subspaces are very important in representation theory, as 
they allow us to block diagonalize the matrices corresponding to our operators. If 
we have a representation (II, V) of a group G where V decomposes into a direct 
sum of invariant subspaces W , , 


V = Wi © W 2 © • • • © Wk, 


and if B , are bases for Wj and we take the union of the B, as a basis for V, then the 
matrix representation of the operator II (g) in this basis will look like 


/[n (g)hi 


[n(s)] B = 


V 


[n 2 (g)]B 2 


\ 

[n*(g)]B*y 


where each [II (g)]g, is the matrix of 11(g) restricted to the subspace Wj. 


(5.64) 
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Thus, given a finite-dimensional representation (n, V ) of a group or Lie algebra, 
we can try to get a handle on it by decomposing V into a direct sum of (two or 
more) invariant subspaces. Each of these invariant subspaces forms a representation 
in its own right, and we could then try to further decompose these representations, 
iterating until we get a decomposition V — Wi © ■ ■ • © Wk in which each of the W, 
have no non-trivial invariant subspaces (if they did, we might be able to decompose 
further). These elementary representations play an important role in the theory, as 
we will see, and so we give them a name: we say that a representation W that has no 
non-trivial invariant subspaces is an irreducible representation (or irrep, for short). 
Furthermore, if a representation (n, V ) admits a decomposition V — Wi ® • • • © 
Wk where each of the Wi are irreducible, then we say that (n, V) is completely 
reducible . l0 If the decomposition consists of two or more summands, then we say 
that V is decomposable. 1 1 Thus we have the following funny-sounding sentence: if 
(n, V ) is completely reducible, then it is either decomposable or irreducible! 

You may be wondering at this point how a finite-dimensional representation 
could not be completely reducible; after all, it is either irreducible or it contains 
a non-trivial invariant subspace W ; cannot we then decompose V into W and some 
subspace W' complementary to IT? We can, but the potential problem is that W' 
may not be invariant; that is, the group or Lie algebra action might take vectors in 
W' to vectors that do not lie in W' . For an example of this, see Problem 5.7. How- 
ever, there do exist many groups and Lie algebras for whom every finite-dimensional 
representation is completely reducible. Such groups and Lie algebras are said to be 
semi-simple . 12 (Most of the matrix Lie groups we have met and their associated 
Lie algebras are semi-simple, but some of the abstract Lie algebras we have seen 
[like the Heisenberg algebra], as well as the matrix Lie groups U ( n ), are not.) Thus, 
an arbitrary finite-dimensional representation of a semi-simple group or Lie algebra 
can by definition always be written as a direct sum of irreducible representations. 
(This is what we did when we wrote M„(R) as M„(M) = M.I © S' n (R) © A„(R), 
though we cannot yet prove that the summands are irreducible.) If we know all the 
irreducible representations of a given semi-simple group or Lie algebra, we then 
have a complete classification of all the finite-dimensional representations of that 
group or algebra, since any representation decomposes into a finite sum of irreps. 
This makes the determination of irreps an important task, which we will complete 
in this chapter for our favorite Lie algebras. 

Example 5.23 Decomposition of tensor product representations 

Say we have a semi-simple group or Lie algebra and two irreps Vi and Vi. The 
tensor product representation V\ <g> Vi is usually not irreducible, but since our group 


10 Note that an irreducible representation is, trivially, completely reducible, since V = V is a de- 
composition into irreducibles. Thus ‘irreducible’ and ‘completely reducible’ are not mutually ex- 
clusive categories, even if they may sound like it! 

1 1 This terminology is not standard but will prove useful. 

12 Semi-simplicity can be defined in a number of equivalent ways, all of which are important. For 
more, see Hall [8] or Varadarajan [ 17]. 
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or Lie algebra is semi-simple we can decompose V\ <g> V 2 as 

Vi ® V 2 = W\ © • • • ® W k 

where the W, are irreducible. In fact, the last decomposition in (5.63) is just the 
matrix version of the <9(n) -invariant decomposition 

T 2 °(K' ! ) = M” 0 K" = Kg © 5 2 '(K") © A 2 (R") (5.65) 

where 

g = 'Y^,e i ®e i (5.66) 

i 

S 2 (R n ) = |fe S 2 (R") I 8ijT l j = 0} where <$,y is Kronecker delta. (5.67) 

Another familiar instance of the decomposition of a tensor product representation 
into irreducibles is when we are dealing with SU(2) representations. As discussed 
in Example 3.21, it is well known that if we have the spin j representation C 2, ' +l = 
V / and spin j' representation C 2 - 7 +1 = V p (we will discuss these representations 
thoroughly in the next section), then their tensor product decomposes as 

Vj <g> V j' = V\j-y\ ® y| / _y '| +1 <g> • • • ® Vj+ji. (5.68) 

The process of explicitly constructing this decomposition is known in the physics 
literature as the “addition of angular momentum’’. 

Example 5.24 Decomposition of A 2 R 4 as an 0(3) representation 

Consider the second rank antisymmetric tensor representation of <9(3, 1) on A 2 ! 1 . 4 , 
but restrict the representation to <9(3 ) where we view <9(3) C <9(3, 1) in the obvious 
way. This representation of (9(3) is reducible, since clearly A 2 R 3 C A 2 R 4 is an 
<9(3)-invariant subspace spanned by 

fl = e 2 A e 3 

h = e 3 A e\ (5.69) 

h = e\ A e 2 . 

There is also a complementary invariant subspace, spanned by 

U = e 1 A £4 

fs = e 2 A e 4 (5.70) 

fe = e 3 A e 4 . 

This second subspace is clearly equivalent to the vector representation of (9(3) since 
(9(3) leaves e 4 unaffected (the unconvinced reader can quickly check that the map 
4> : e; A e 4 1 — >■ a is an intertwiner). Thus, as an (9(3) representation, A 2 R 4 decom- 
poses into the vector and pseudovector representations, i.e. 


2ira4 


3 


2tb>3 


(5.71) 
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We can interpret this physically in the case of the electromagnetic field tensor, which 
lives in A 2 1R 4 ; in that case, (5.71) says that under (proper and improper) rota- 
tions, some components of the field tensor transform amongst themselves as a 
vector, and others as a pseudovector. The components that transform as a vector 
comprise the electric field, and those that transform like a pseudovector comprise 
the magnetic field. To see this explicitly, one can think about which basis vectors 
from (5.70) and (5.69) go with which components of the matrix in (3.62). 

One could, of course, further restrict this representation to 5(9(3) C <9(3, 1); one 
would get the same decomposition (5.71), except that for SO( 3) the representations 
R 3 and A 2 R 3 are equivalent, and so under (proper) rotations the field tensor trans- 
forms as a pair of vectors, the electric field “vector” and the magnetic field “vector”. 
Of course, how these objects really transform depends on what transformation group 
you are talking about! The electric field and magnetic field both transform as vec- 
tors under rotations, but under improper rotations the electric field transforms as a 
vector and the magnetic field as a pseudovector. Furthermore, under Lorentz trans- 
formations the electric and magnetic fields cannot be meaningfully distinguished, 
as they transform together as the components of an antisymmetric second rank ten- 
sor! Physically, this corresponds to the fact that boosts can turn electric fields in one 
reference frame into magnetic fields in another, and vice versa. 

Example 5.25 Decomposition of L 2 (S 2 ) into irreducibles 


A nice example of a direct sum decomposition of a representation into its irreducible 
components is furnished by L 2 (S 2 ). As we noted in Example 5.9, the spectral theo- 
rems of functional analysis tell us that the eigenfunctions Y l m of the spherical Lapla- 
cian A S 2 form an orthogonal basis for the Hilbert space L 2 (S 2 ). We already know, 
though, that the Y m of fixed / form a basis for Hi, the spherical harmonics of de- 
gree /. We will show in the next section that each of the Hi is an irreducible 5(9(3) 
representation, so we can decompose L 2 (S 2 ) into irreducible representations as 


L 2 (S 2 ) = 07Y, = Hi © H 2 © Hi © • • •. 

i 


□ 


Before concluding this section we should point out that the notion of direct sum 
is useful not only in decomposing a given vector space into mutually exclusive sub- 
spaces, but also in ‘adding’ vector spaces together. That is, given two vector spaces 

V and W we can define their direct sum V © W to be the set V x W with vector 
addition and scalar multiplication defined by 

(tfi, Wi) + (1>1, W2) = (ifi + l>2, W\ + W2) 

c(v, w) = ( cv , cw). 

It is straightforward to check that with vector addition and scalar multiplication so 
defined, V © W is a bona fide vector space. Also, V can be considered a subset of 

V © W, as just the set of all vectors of the form ( v , 0), and likewise for W. With this 
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identification it is clear that any element (v, w) e V ® W can be written uniquely as 
v + w with v e V and w e W, so this notion of a direct sum is consistent with our 
earlier definition: if we take the direct sum of V and W, the resulting vector space 
really can be decomposed into the subspaces V and W. 

This notion 13 of direct sum may also be extended to representations; that is, given 
two representations (II; , V,- ), i = 1 , 2, we may construct the direct sum representa- 
tion (Eli © n 2 , V] © V 2 ) defined by 

((rii © n 2 )(g))(i>i, v 2 ) = n 2 (g)u 2 ) v(m, v 2 ) e vj © v 2 , ge g. 

These constructions may seem trivial, but they have immediate physical application, 
as we will now see. 

Example 5.26 The Dirac spinor 

Consider the left- and right-handed spinor representations of SL( 2, C), (II, C 2 ) and 
(n, C 2 ). We define the Dirac spinor representation of SL( 2, C) to be the direct sum 
representation (n © fi, C 2 © C 2 ), which is then given by 

((n©f[)(A))(v,w)= (An, A^io) V(i>, w) e C 2 © C 2 , AeSL{ 2,C). 

Making the obvious identification of C 2 © C 2 with C 4 , we can write (n © 11) (A) 
in block matrix form as 



as in (5.64). 

This representation is, by construction, decomposable. Why deal with a decom- 
posable representation rather than its irreducible components? There are a few dif- 
ferent ways to answer this in the case of the Dirac spinor, but one rough answer has 
to do with parity. We will show in Sect. 5.11 that it is impossible to define a consis- 
tent action of the parity operator on either the left-handed or right-handed spinors 
individually, and that what the parity operator naturally wants to do is interchange 
the two representations. Thus, to have a representation of SL{ 2, C) spinors on which 
parity naturally acts, we must combine both the left- and right-handed spinors into 
a Dirac spinor, and in the most natural cases parity is represented by 



(5.72) 


which obviously just interchanges the left- and right-handed spinors. From this we 
see that the Dirac spinor is reducible under SL( 2, C), but not under larger groups 
which include parity. 


13 In some texts our first notion of direct sum, in which we decompose a vector space into mutually 
exclusive subspaces, is called an internal direct sum, and our second notion of direct sum, in which 
we take distinct vector spaces and add them together, is known as an external direct sum. 
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5.7 More on Irreducibility 

In the last section we introduced the notion of an irreducible representation, but we 
did not prove that any of the representations we have met are irreducible. In this 
section we will remedy that and also learn a bit more about irreducibility along the 
way. In proving the irreducibility of a given representation, the following proposi- 
tion about the irreps of matrix Lie groups and their Lie algebras is often useful. The 
proposition just says that for a connected matrix Lie group, the irreps of the group 
are the same as the irreps of the Lie algebra. This may seem unsurprising and per- 
haps even trivial, but the conclusion does not hold when the group is disconnected. 
We will have more to say about this later. The proof of this proposition is also a nice 
exercise in using some of the machinery we have developed so far. 

Proposition 5.3 A representation (II, V ) of a connected matrix Lie group G is 
irreducible if and only if the induced Lie algebra representation (n, V) of Q is irre- 
ducible as well. 

Proof First, assume that (II, V) is an irrep of G. Then consider the induced Lie 
algebra representation (tc, V), and suppose that this representation has an invariant 
subspace W. We will show that W must be an invariant subspace of (LI, V) as well, 
which by the irreducibility of (II, V) will imply that W is either V or {0}, which 
will then show that (it, V) is irreducible. For all X eg, w e W, we have 


n(X)we W 


=> e K ^ X) weW 
=> n (e x )weW. 


(5.73) 


Now, using the fact that any element of G can be written as a product of exponentials 
(cf. Proposition 5.2), we then have 


n(g)w = Yl[e tlXl e' 2X2 ■ ■ ■ e'" x ")w 

= Tl(e tlXl )n(e r2X2 ) ■ ■ ■ n(e ,llX ")w 


which must be in W by repeated application of (5.73). Thus W is also an invariant 
subspace of (II, V ), but we assumed (II, V ) was irreducible, and so W must be 
equal to V or 0. This then proves that (tt. V) is irreducible as well. 

Conversely, assume that (it, V ) is irreducible, and let W be an invariant subspace 
of (II, V). Then for all t e R, X e g, w e W we know that Yl(e tX )w e W, and hence 



= lim — 

r-»0 t 


II (e ,x )w — w 


must be in W as well, so W is invariant under re and hence must be equal to V 
or {0}. Thus (II, V) is irreducible. □ 
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Note that the assumption of connectedness was crucial in the above proof, as 
otherwise we could not have used Proposition 5.2. Furthermore, we will soon meet 
irreducible representations of disconnected groups (like 0(3. 1)) that yield Lie al- 
gebra representations which do have non-trivial invariant subspaces, and are thus 
not irreducible. The above proposition is still very useful, however, as we will see 
in this next example. 

Example 5.27 The SU ( 2 ) representation on P/(C 2 ), revisited 

In this example we will prove that the su(2) representations (jr/ , P/( C 2 )) of Exam- 
ple 5.7 are all irreducible. Proposition 5.3 will then tell us that the SU( 2) repre- 
sentations (If/, Pj( C 2 )) are all irreducible as well. Later on, we will see that these 
representations are in fact all the finite-dimensional irreducible representations of 
SU( 2)! 

Recall that the n/ (Si ) are given by 

i ( 9 3 \ 

’ t ' W,= 2( a fe7 + Z ‘tej 

1 / 3 3 \ 

MSi)= 2\ zi Hr z 'Hd 

’ r,<w = 5L , to“ a te> 

Define now the “raising” and “lowering” operators 

3 

Y\ = 7Ti(S 2 ) + ini(Si) = -41 — 

dZ2 

3 

X, = -7Tl(S 2 ) + ITTiiSi) = -Z2 — . 

oz i 

It is easy to see that 7/ trades a factor of z 2 for a factor of z i , thereby raising the 
JtiiSi) eigenvalue of a single term by i. Likewise, X / trades a zi for a z 2 and lowers 
the eigenvalue by i. Consider a nonzero invariant subspace W C P/(C 2 ). If we can 
show that W = Pi(C 2 ), then we know that P/( C 2 ) is irreducible. Being nonzero, W 
contains at least one element of the form 

w = a/z[ + ai-\z[~ l Z 2 + ai- 2 z\~ 2 zl H h a 0 z l 2 

where at least one of the a k is not zero. Let Li be the biggest nonzero value of k, so 
that a^z^Zj k ° is the term in w with the highest power of zi . Then applying ( Xi) k ° 


14 You may object to the use of i in our definition of these operators; after all, su(2) is a real Lie 
algebra, and so the expression S 2 + i Si has no meaning as an element of su(2), and so one cannot 
say that, for instance, Y/ = jt(S 2 + i Si). Thus X / and T; are not in the image of su(2) under 717 . 
This is a valid objection, and to deal with it one must introduce the notion of the complexification 
of a Lie algebra. A discussion of this here would lead us too far astray from our main goals of 
applications in physics, however, so we relegate this material to the appendices which you can 
consult at leisure. 
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to w lowers the zi degree by ko, killing all the terms except ak 0 z\°z 2 k ° ■ In fact, one 
can compute easily that 

xf°K4 0 4“ A ’°) = (-l)Ho'ai-o4 

This is proportional to z\, and since W is invariant W must then contain z 2 - But 
then we can successively apply the raising operator Y/ to get monomials of the form 
z k t z l f~ k for 0 < k < l, and so these must be in W as well. These, however, form a 
basis for Pf C 2 ), hence W must equal P/( C 2 ), and so PfC 2 ) is irreducible. □ 

Before moving on to our next examples, we need to state and prove Schur’s 
lemma, one of the most basic and crucial facts about irreducible representations. 
Roughly speaking, the upshot of it is that any linear operator on the representation 
space of an irrep which commutes with the group action (i.e. is an intertwiner) must 
be proportional to the identity. The precise statement is as follows: 

Proposition 5.4 (Schur’s lemma) Let (If/, V)), i — 1,2 be two irreducible repre- 
sentations of a group or Lie algebra, and let 0 : V\ — > V 2 be an intertwiner. Then ei- 
ther 0 = 0 or tp is a vector space isomorphism. Furthermore, if(Y 1 1 , V\) — (IT 2 , V 2 ) 
and V] is a finite-dimensional complex vector space, then (p is a multiple of the iden- 
tity, i.e. (f> — cl for some c e C. 

Proof We will prove this for group representations FI, ; the Lie algebra case follows 
immediately with the obvious notational changes. Let K be the kernel or null space 
of 0 . Then K C V) is an invariant subspace of n 1 , since for any v e K , g e G, 

</>(ni(g)u) = n 2 (g)(0(u)) 

= n 2 (0) 

= 0 

=>• flj (g)i) e K. 

However, since (n 1 , Vi ) is irreducible, the only invariant subspaces are 0 and V] , so 
K must be one of those. \f K = V\ then <fi — 0, so henceforth we assume that K = 0, 
which means that f is one-to-one, and so f ( V \ ) C V 2 is isomorphic to V\ . Further- 
more, <p(V 1 ) is an invariant subspace of (TL, Vf), since for any <f>(v) e <p(V\), g e G 
we have 


(n 2 (g))(0(w)) = 0(n 1 (g)i;)e0(Vi). 

But (n 2 , V 2 ) is also irreducible, so <fi(V 1 ) must equal 0 or V 2 . We already assumed 
that <p{V\) ^ 0, though, so we conclude that <p(V\) = V 2 and hence </> is an isomor- 
phism. 

Now assume that (11 1 , V) ) = ( n 2 , V 2 ) = (IT, V) and that V is a finite-dimen- 
sional complex vector space of dimension n (notice that we did not assume finite- 
dimensionality at the outset of the proof). Then 0 is a linear operator and the eigen- 
value equation 
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det (0 — XI) 

is a nth degree complex polynomial in X. By the fundamental theorem of algebra, 15 
this polynomial has at least one root c e C, hence <j> — cl has determinant 0 and 
is thus non-invertible. This means that <j> — cl has a non-trivial kernel K . K is 
invariant, though; for all v e K. 

W> - c/)(n(g)u) = </>(n(g)u) - cn(g)u 

= n(g)(0(u) - cv) 

= 0 

and since we know K ^ 0, we then conclude by irreducibility of V that K — V . 
This means that (0 — cl)(v) = 0 Vu e V, which means 0 = cl, as desired. □ 

Exercise 5.33 Prove the following corollary of Schur’s lemma: If (13;, V;), i = 1,2 are 
two complex irreducible representations of a group or Lie algebra, and 0, 0 : V\ — »■ Vi are 
two intertwiners with f f 0, then 0 = cf for some c G C. 

As a first application of Schur’s lemma, we have 

Proposition 5.5 An irreducible finite-dimensional complex representation of an 
abelian group or Lie algebra is one-dimensional. 

Proof Again we prove only the group case. Since G is abelian, each 11 (g) com- 
mutes with 11(h) for all h <= G, hence each 11 (g) : V — > V is an intertwiner! By 
Schur’s lemma, this implies that every 11(g) is proportional to the identity (with 
possibly varying proportionality constants), and so every subspace of V is an invari- 
ant one. Thus the only way V could have no non-trivial invariant subspaces is to 
have no non-trivial subspaces at all, which means it must be one-dimensional. □ 

Exercise 5.34 Show that the fundamental representation of 50(2) on R 2 is irreducible. 
Prove this by contradiction, showing that if the fundamental representation were reducible 
then the 50(2) generator 



would be diagonalizable over the real numbers, which you should show it is not. This shows 
that one really needs the hypothesis of a complex vector space in the above proposition. 

Example 5.28 The irreducible representations of 7*2 

Proposition 5.5 allows us to easily enumerate all the irreducible representations 
of 7*2. Since 7i is abelian any irreducible representation (nin-, V ) must be one- 
dimensional (i.e. V — R or C), and must also satisfy 


15 See Herstein [9], for instance. 


188 


5 Basic Representation Theory 


(n irr (-i)) 2 = n irr ((-i) 2 ) = n irr (i) = 1 , 

which means that !!;„•(— 1) = ±1, i.e. n nl is either the alternating representation or 
the trivial representation! Furthermore, 'Ll is semi-simple (as you will show in Prob- 
lem 5.8), and so any representation (FI, V) of Z 2 is completely reducible and thus 
decomposes into one-dimensional irreducible subspaces, on which n(— 1) equals 
either 1 or —1. Thus n(— 1) is diagonalizable, with eigenvalues ±1 (cf. Exam- 
ple 5.18 for examples of this). Let us say the L 2 in question is Z 2 — {/, P} C 0(3), 
and that we are working in a quantum mechanical context with a Hilbert space Tt. 
Then there exists a basis for 'H consisting of eigenvectors of n(P); for a given 
eigenvector 1 jr, its eigenvalue of ±1 is known as its parity. If the eigenvalue is +1 
then 1 jr is said to have even parity, and if the eigenvalue is — 1 then \Jr is said to have 
odd parity. □ 


5.8 The Irreducible Representations of su( 2), 517(2) and 50(3) 

In this section we will construct (up to equivalence) all the finite-dimensional ir- 
reducible complex representations of su(2). Besides being of intrinsic interest, our 
results will also allow us to classify all the irreducible representations of SO( 3), 
SU(2), and even the apparently unrelated representations of so(3, 1), SO( 3, 1) and 
SL( 2, C). The construction we will give is more or less the same as that found in the 
physics literature under the heading “theory of angular momentum”, except that we 
are using different language and notation. Our strategy will be to use the commuta- 
tion relations to deduce the possible structures of su(2) irreps, and then show that 
we have already constructed representations which exhaust these possibilities, thus 
yielding a complete classification. 

Let (tt, V ) be a finite-dimensional complex irreducible representation of su(2). 
It will be convenient to use the following shorthand, familiar from the physics liter- 
ature: 


J z = in(S z ) 

J+ = in(S x ) — jr(S y ) (5.74) 

J- = irc(S x ) + jr(Sy). 

These “raising” and “lowering” operators obey the following commutation relations, 
as you can check: 


7 ±] = ±J± 

[J+,J-] = 2J Z . 

Now, as discussed in our proof of Schur’s lemma, the fact that V is complex means 
that every operator on V has at least one eigenvector. In particular, this means that 
J z has an eigenvector v with eigenvalue b. The above commutation relations then 
imply that 
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J z (J±v ) = [J z , J±]v + J±{J Z v) = (b ± l)/±i> 

so that if J±v is not zero (which it might be!), then it is another eigenvector of J z 
with eigenvalue b ± 1 . Now, we can repeatedly apply J + to v to yield more and more 
eigenvectors of J z , but since V is finite-dimensional and eigenvectors with different 
eigenvalues are linearly independent (see Exercise 5.35 below), this process must 
end somewhere, say at N applications of J + . Let vq be this vector with the highest 
eigenvalue (also known as the highest weight vector of the representation), so that 
we have 

v 0 = (J+) N v 
J+vq = 0. 

Then Vq has a J z eigenvalue of h + /V = j (note that so far we haven’t proved any- 
thing about b or j, but we will soon see that they must be integral or half-integral). 
Starting with vq, then, we can repeatedly ‘lower’ with /_ to get eigenvectors with 
lower eigenvalues. In fact, we can define 

Vk = (J-) k V 0 

which has J z eigenvalue j — k. This chain must also end, though, so there must exist 
an integer / such that 

v t = (■/_)' u 0 ^0 but vi+i = (/_) ,+1 u 0 = 0. 

How can we find /? For this we will need the following formula, which you will 
prove in Exercise 5.36 below: 

J+{v k ) = [2jk-k{k- l)]u*_i. (5.75) 

Applying this to vi+\ — 0 gives 

0 = J+(v m ) = [2j(l + 1) - (Z + 1)Z>; 

and since u/ / 0 we conclude that 

[2;(/+D — (/+l)/] = 0 <=► j — 1/2. 

Thus j is a nonnegative integer or half integer! (In fact, j is just the ‘spin’ 
of the representation.) We further conclude that V contains 2j + 1 vectors 
{Lfc I 0 < k < 2j] all of which are eigenvectors of J z with eigenvalue j — k. Further- 
more, since 

tt(S x ) = — -(/+ + J~) 

the action of S x and S y take a given Vk into a linear combination of other i so the 
span of Vk is a nonzero invariant subspace of V . We assumed V was irreducible, 
however, so we must have V = Spanju*}, and since the Vk are linearly independent, 
they form a basis for V ! 
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To summarize, any finite-dimensional irreducible complex representation of 
su(2) has dimension 2j + 1, 2j e Z and a basis {v* }*=o — 2 / which satishes 

J+(vo) = 0. 

J-(.Vk) = Vk+ 1, fe < 2j 

J z (vk) = (j ~ k)v k (5.76) 

J-(vij) = 0 

J+(v k ) = [2jk-k(k- l)]ujt-i, 

What is more, we can actually use the above equations to define representations 
(it j, Vj), where Vj is a (2 j + 1) -dimensional vector space with basis {vk} k =o- 2 j 
and the action of the operators Jtj(Sj) is defined by (5.76). It is straightforward to 
check that this defines a representation of su(2) (see exercise below), and one can 
prove irreducibility in the same way that we did in Example 5.27. Furthermore, any 
irrep (it, V ) of su(2) must be equivalent to (jtj, Vj) for some j, since we can find a 
basis w k for V satisfying (5.76) for some j and then define an intertwiner by 

u)k h* Vk 

and extending linearly. We have thus proved the following: 

Proposition 5.6 The su(2) representations (jtj , Vj), 2 j e Z defined above are all 
irreducible, and any other finite-dimensional complex irreducible representation of 
su(2) is equivalent to (jtj , V j) for some j , 2j e Z. 

In other words, the (rtj, Vj) are, up to equivalence, all the finite-dimensional 
complex irreducible representations of su(2). They are also all the finite-dimen- 
sional complex irreducible representations of 50(3), since bu( 2) ~ so(3). 

If you look back over our arguments you will see that we deduced (5.76) from 
just the su(2) commutation relations, the finite-dimensionality of V, and the exis- 
tence of a highest weight vector i>o satisfying J + (vo) = 0 and J z (i>o) = jv o . Thus, if 
we have an arbitrary (i.e. not necessarily irreducible) finite-dimensional su(2) rep- 
resentation (it, V) and can find a highest weight vector vo for some j, we can lower 
with J- to generate a basis {v k }k=0-2j satisfying (5.76) and conclude that V has an 
invariant subspace equivalent to (jtj, Vj). We can then repeat this until V is com- 
pletely decomposed into irreps. If we know that (n, V) is irreducible from the start 
then we do not even have to find the vector vq, we just use the fact that (it, V) must 
be equivalent to (jtj, Vj) for some j and note that j is given by j = ^(dim V — 1). 
These observations make it easy to identify which (it j, Vj) occur in any given su(2) 
representation. 

Note that if we have a finite-dimensional complex irreducible su(2) representa- 
tion that is also unitary, then we could work in an orthonormal basis. In that case, 
it turns out that the Vk defined above are not orthonormal, and are thus not ideal 
basis vectors to work with. They are orthogonal, but are not normalized to have 
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unit length. In quantum mechanical contexts the su(2) representations usually are 
unitary, and so in that setting one works with the orthonormal basis vectors | m), 
—j< m < j ■ The vector \m) is proportional to our v ; _ m , and is normalized whereas 
Vj- m is not. See Sakurai [ 14] for details on the normalization procedure. 

Exercise 5.35 Let S = [Vi\i = i-k be a set of eigenvectors of some linear operator T on a 
vector space V. Show that if each of the i>; have distinct eigenvalues, then S is a linearly 
independent set. (Hint: One way to do this is by induction on k. Another is to argue by 
assuming that S is linearly dependent and reaching a contradiction. In this case you may 
assume without loss of generality that the u; , 1 < i < k — 1 are linearly independent, so that 
Vk is the vector that spoils the assumed linear independence.) 

Exercise 5.36 Prove (5.75). Proceed by induction, i.e. first prove the formula for k = 1, 
then assume it is true for k and show that it must be true for k + 1 . 

Exercise 5.37 Show that (5.76) defines a representation of su(2). This consists of showing 
that the operators /+, J-, J z satisfy the appropriate commutation relations. Then show that 
this representation is irreducible, using an argument similar to the one from Example 5.27. 

Example 5.29 P/( C 2 ), revisited again 

In Example 5.7 we met the representations (iti, P/(C 2 )) of su(2) on the space of 
degree / polynomials in two complex variables. In Example 5.27 we saw that these 
representations are all irreducible, and so by setting dimP/(C 2 ) = / + 1 equal to 
2j + 1 we deduce that 


(iti, P/(C 2 )) ~ Vi/ 2 ) 


(5.77) 


and so the (tt/, P/(C 2 )), / e Z also yield all the complex finite-dimensional ir- 
reps of su(2). What is more, this allows us to enumerate all the finite-dimensional 
complex irreps of the associated group SJJ(2). Any irrep (II, V ) of SU(2) yields 
an irrep (it, V ) of su(2), by Proposition 5.3. This irrep must be equivalent to 
(777 , P/(C 2 )) for some / e Z, however, and so by Proposition 5.2 (II, V ) is equiv- 
alent to (II/, P/(C 2 )). Thus, the representations (II/, P/(C 2 )), I e Z are (up to 
equivalence) all the finite-dimensional complex irreducible representations of 
5(7(2)! 

It is instructive to construct the equivalence (5.77) explicitly. Recall that the rais- 
ing and lowering operators (which we called Y\ and X / in Example 5.27) are given 
by 



and that 



It is easy to check that vq = z l 2 is a highest weight vector with j — I /2, and so the 
basis that satisfies (5.76) is given by 
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«t = (y_)*(4) = (-l) t ^— 44"*, 0 <k<l. (5.78) 

Exercise 5.38 Show by direct calculation that ( 712 j , P 2 j (C 2 )) satisfies (5.76) with basis 
vectors given by (5.78). 

Example 5.30 5 2/ (C 2 ) as irreps of su(2) 


We suggested in Example 5.15 that the tensor product representation of 
su(2) on 5 2/ '(C 2 ), the totally symmetric (0,2 j) tensors on C 2 , is equivalent to 
C Tt 2 j , P 2 j(C 2 )) — {nj, Vj), 2 j eZ. With the classification of su(2) irreps in place, 
we can now prove this fact. It is easy to verify that 


i>o = ei <S> • • • <S> ei e 5 2/ (C 2 ) (5.79) 

^ v 

2 j times 

is a highest weight vector with eigenvalue j, so there is an irreducible invariant 
subspace of 5 2/ (C 2 ) equivalent to Vj. Since dim Vj — dim .S' 2 -' (C 2 ) = 2j + 1 (as 
you can check), we conclude that Vj is all of 5 2/ (C 2 ), and hence that 


{S 2j n, 5 2/ (C 2 )) ~ (Try, Vj) ~ {it 2j , P 2j { C 2 )) 


(5.80) 


which also implies (S 2 m, S 2 i (C 2 )) ~ {Yl 2j , P 2j (C 2 )). Thus, we see that every 
irreducible representation of su(2) and SU (2) can be obtained by taking a sym- 
metric tensor product of the fundamental representation. Thus, using nothing 
more than the fundamental representation (which corresponds to j = 1/2, as ex- 
pected) and the tensor product, we can generate all the representations of su(2) 
and SU( 2). We will see in the next section that the same is true for 50(3, 1) and 
SO( 3, 1) 0 . □ 


Our results about the finite-dimensional irreps of su(2) and 51/(2) are not 
only interesting in their own right; they also allow us to determine all the ir- 
reps of 50(3)! To see this, first consider the degree / harmonic polynomial rep- 
resentation (n,,7f/(M 3 )) of 50(3). This induces a representation (jzi,Hi(S?)) of 
50(3) ~ su(2). It is easy to check (see exercise below) that if we define, in analogy 
to the su(2) case, 

J z = ini(L z ) = i 
J . f = i7t(L x ) — JT 
J- = iTt(L x ) + 7 r 

then the vector /o = (x + iy) 1 is a highest weight vector with eigenvalue /, and 
so 'Hi (R 3 ) has an invariant subspace equivalent to ( 717 , V/). We will argue in Prob- 
lem 5.9 that dim7-(/(R 3 ) = 21 + 1, so we conclude that (717, 7 -(/(K 3 )) ~ ( 717 , V/), and 
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is hence an irrep of so(3). Proposition 5.3 then implies that (If/, 7Y/(R 3 )) is an irrep 
of 50(3)! 

Are these all the irreps of SO (3)1 To find out, let (11, V) be an arbitrary finite- 
dimensional complex irrep of 5(9(3). Then the induced so(3) ~ su(2) representation 
(tt, V) must be equivalent to (jTj , Vj) for some integral or half-integral j . We just 
saw that any integral j value is possible, by taking (n, V) — (n 7F ;(R 3 )). What 
about half-integral values of jl In this case, we have (careful not to confuse the 
number tt with the representation jt !) 

e 2jtjr( L z ) Vo = e ~ i27lJz vo 

— e~ l27T J vq since vq has eigenvalue j 
= —vq since j half-integral. 

However, we also have 

e 2n-x(L z ) Vo _ (n(e 2jr ' iz ))u 0 

= (n(/))vo by (4.86) 

= Vo, 


a contradiction. Thus, j cannot be half-integral, so (tt, V ) must be equivalent to 
(tt j , Vj) for j integral. But this implies that (tt, V ) ~ (nj,'Hj( 1R 3 )), and so by 
Proposition 5.2, (FI, V) is equivalent to (Il 7 -, 7f ; (K 3 ))! Thus, the representations 
(H j,Hj(Rl))J e Z are (up to equivalence) all the finite-dimensional complex 
irreducible representations of S0(3). 

An important lesson to take away form this is that for a matrix Lie group G with 
Lie algebra g, not all representations of g necessarily come from representations 
of G. If G = 5(7(2 ) then there is a one-to-one correspondence between Lie algebra 
representations and group representations, but in the case of S0( 3) there are Lie 
algebra representations (corresponding to half-integral values of j ) that do not come 
from 5(9(3) representations. We will not say much more about this here, except to 
note that this is connected to the fact that there are non-isomorphic matrix Lie groups 
that have isomorphic Lie algebras, as is the case with 5(7(2) and 5(9(3). For a more 
complete discussion, see Hall [8], 

Exercise 5.39 Verify that 


J+fo = 0 
Jzfa = l fa- 


5.9 Real Representations and Complexifications 

So far we have classified all the complex finite-dimensional irreps of su(2), 5(7(2) 
and 5(9(3), but we haven’t said anything about real representations, despite the fact 
that many of the most basic representations of these groups and Lie algebras (like 
the fundamental of 5(9(3) and all its various tensor products) are real. Fortunately, 
there is a way to turn every real representation into a complex representation, so 
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that we can then apply our classification of complex irreps. Given any real vector 
space V, we can define the complexification of V as Vr = C 0 V , where C and 
V are thought of as real vector spaces (C being a 2-dimensional real vector space 
with basis {1, ;}), so that if {e,} is a basis for V then {1 0 e,-, i 0 e,j is a (real) 
basis for Vc- Note that Vc also carries the structure of a complex vector space, with 
multiplication by i defined by 

i(z 0v) = ( iz ) <S>v, z e C, v e V. 

A complex basis for Vc is then given by { 1 <g> e, }, and the complex dimension of Vic 
is equal to the real dimension of V. 

We can then define the complexification of a real representation (n, V) to be the 
(complex) representation (lie, Vc) defined by 

(nc(g))(z® v)=z® n(g)u. (5.81) 

We can then get a handle on the real representation (II, V) by applying our clas- 
sification scheme to its complexification (lie, Vc). You should be aware, however, 
that the irreducibility of (II, V) does not guarantee the irreducibility of (lie, Vc) 
(see Exercise 5.40 and Example 5.39), though in many cases (lie, Vc) will end up 
being irreducible. 

Example 5.31 The complexifications of R" and M„ (M) 

As a warm-up to considering complexifications of representations, we consider the 
complexification of a simple vector space. Consider the complexification Rjjf of M" . 
We can define the (obvious) complex-linear map 

<p : ->■ C" 

1 0 (a-'e/) + i 0 (b^ej) m- (a- 7 + ib^ej, af b' e M 

which is easily seen to be a vector space isomorphism, so we can identify R£, 
with C" . One can also extend this argument in the obvious way to show that 

(M„(R)) C = M„(C). 


Example 5.32 The fundamental representation of so (3) 

Now consider the complexification (irc,®^) of the fundamental representation of 
50(3). As explained above, can be identified with C 3 , and a moment’s consider- 
ation of (5.81) will show that the complexification of the fundamental representation 
of so(3) is just given by the usual so(3) matrices acting on C 3 rather than K 3 . You 
should check that J z and / + are given by 



0 -i 0 

1 0 0 

0 0 0 
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and that the vector 



vo = e\+ ie 2 — 


is a highest weight vector with j = 1. This, along with the fact that dimC 3 = 3, al- 
lows us to conclude that (jrc, R^) — Ctti , Vj). This is why regular three-dimensional 
vectors are said to be “spin-one”. Notice that our highest weight vector vq = e\ + ie 2 
is just the analog of the function /o = x + iy e Hi (R 3 ), which is an equivalent rep- 
resentation, and that in terms of SO( 3) reps we have 


(nc,R;c) ~(n!,Wi(M 3 )). 

Recall also that the adjoint representation of su(2) ~ so (3) is equivalent to the 
fundamental representation of so (3). This combined with the above results implies 


(ad c ,so(3) c ) - (ad c ,su(2) c ) - (m, V\) 

so the adjoint representation of su(2) is “spin-one” as well. 


Example 5.33 Symmetric traceless tensors 

Consider the space .S'' 2 ( R 3 ) of symmetric traceless second rank tensors defined in 
(5.67). This is an SO( 3) invariant subspace of R 3 <g> R 3 , and so furnishes a repre- 
sentation ( n . S 2 ' (R 3 )) of SO (3). What representation is this? To find out, consider 
the complexification of the associated so(3) representation, (7Tq C , S 2 (C 3 )), where 
S 2 (C 3 ) is also defined by (5.67), just with C 3 replacing R 3 . Then it is straightfor- 
ward to verify that 


00 = (e\ + ie 2 ) <g> (e\ + ie 2 ) 

is a highest weight vector with j — 2, and this along with fact that dim S 2 ' (C 3 ) = 5 
(check!) implies that 

(^OC’ ^ 2 (C 3 )) — ( 7T2, V 2 ). 

This is why symmetric traceless second rank tensors on R 3 are sometimes said to 
be “spin-two”. 

Recall that S 2 (R 3 ) arose in Example 5.23 as part of the decomposition 
R 3 ® R 3 = Rg ® A 2 (R 3 ) © S 2 ' (R 3 ) 
which has matrix counterpart 

M 3 (R) = R / © A 3 (R) © 5 3 (R). 
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Complexifying and using the fact (cf. Example 5.22) that A 2 (K 3 ) is equivalent to 
the adjoint representation and is hence “spin-one”, we obtain the following decom- 
position: 


Vi <g> Vi ~ C 3 <g> C 3 


M 3 ( C) ~ Vo ® El © v 2 . 


This is an instance of (5.68) and should be familiar from angular momentum addi- 
tion in quantum mechanics. 

Exercise 5.40 Consider the fundamental representation of 50(2) on R 2 , which we know 
is irreducible by Exercise 5.34. The complexification of this representation is just given 
by the same 50(2) matrices acting on C 2 rather than R 2 . Show that this representation is 
reducible, by diagonalizing the 50(2) generator 



Note that this diagonalization can now be done because both complex eigenvalues and com- 
plex basis transformations are allowed, in contrast to the real case. 


5.10 The Irreducible Representations of sl(2, C)r, SL( 2, C) and 
SO( 3, 1) 0 


In this section we will use the techniques and results of the previous section to 
classify all the finite-dimensional complex irreps of sl(2, C)® ~ so(3, 1), and then 
use this to find the irreps of the associated groups SL( 2, C) and 50(3, l) c . 

Let (jt, V ) be a finite-dimensional complex irreducible representation of 
s 1(2, C) R . Define the operators 



(5.82) 


where {5/, Ki] 1 = 1 , 2 , 3 is our usual basis for s 1(2, C)r. One can check that the Ms and 
Ns commute between each other, as well as satisfy the su(2) commutation relations 
internally, i.e. 


[Mi,Nj]= 0 


3 



(5.83) 


k=i 

3 


[Ni,Nj] = J2^jkN k . 


We have thus taken the complex span of the set [jr(Kj), Jt(Si)} (notice the factors 
of i in (5.82)) and found a new basis for this Lie algebra of operators that makes it 
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look like two commuting copies of su(2). We can thus define the usual raising and 
lowering operators 


N± = iN i =F Ni 
M± = iM\ M 2 

which then have the usual commutation relations between themselves and i M z ,iN z : 


[i M z , M±] = ±M± 

[iN z , N±] = ±N± 

[M + , M_] = 2iM~ 

[N+,N-] = 2iN z . 

With this machinery set up we can now use the strategy from the last section. First, 
pick a vector v e V that is an eigenvector of both i M z and i N z (that such a vector 
exists is guaranteed by Problem 5.10). Then by applying M + and N+ we can raise 
the i M z and i N z eigenvalues until the raising operators give us the zero vector; let 
i>0,o denote the vector with the highest eigenvalues (which we will again refer to as 
a ‘highest weight vector’), and let (j i, j 2 ) denote those eigenvalues under iM z and 
i N z , respectively, so that 


M+(uo,o) = 0 
N+(v 0 ,o) — 0 
iM z [v o,o) = ji To.o 
iN z (v 0 ,o) = y 2 %o- 

We can then lower the eigenvalues with N- and M_ to get vectors 

v hM = (M-) k \N-) k2 v o,o 


(5.84) 


(5.85) 


which are eigenvectors of iM z and iN z with eigenvalues j i — k\ and ji — kj, re- 
spectively. By finite-dimensionality of V this chain of vectors must eventually end, 
though, so there exist nonnegative integers l \ , Z 2 such that i;/, ,/ 2 ^ 0 but 


M-(v h j 2 ) = N-{v h j 2 ) = 0. (5.86) 

Calculations identical to those from the su(2) case show that /,- = 2 j , , and that the 
action of the operators M , , N, is given by 


M + (u 0 ,o) = 0 
N+(v o,o) = 0 

M-(vk u k 2 ) = Vk!+i,k 2 , k\ < 2;'i 

N-(vkuk 2 ) = Vk u k 2 +l, k 2 < 2 72 
iM z (vk u k 2 ) = O'l - ki)vk u k 2 
iN z (vk u k 2 ) = (72 - k 2 )vk u k 2 
M-(v 2ju k 2 ) = 0, Vk 2 


(5.87) 
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N-(v ku2 j 2 ) = 0, Vki 

M +( vk u k 2 )= [271*1 - * i(*i - *1 ^0 

N+(Vki,k 2) — [272*2 - *2(*2 - l)]ujfci,jfe 2 — 1. *2^0 


(this is just two copies of (5.76), one each for the M,- and the Nj). As in the su(2) 
case, we note that the {v kuk2 \ are linearly independent and span an invariant sub- 
space of V, hence must span all of V since we assumed V was irreducible. Thus, 
we conclude that any complex finite-dimensional irrep of sl(2, C)r is of the form 
("Ui.h)’ v Vuh)) where v Uuh) has a basis 


B = {vk u k 2 I 0 < *1 < 2j\ , 0 < *2 < 272 } 


and the operators tz ( j l ,j 2 )(Si)-> n (j],h)(Ki) satisfy (5.87), with 2 jj , 2 jo e Z. This 
tells us that 


dim v UiJ2) = ( 2 7 1 + 1)( 2 72 + !)• 


Let us abbreviate the representations Ctr(;jj 2 ), V(j 1 j 2 )) as simply (ji , j 2 ) as is 
done in the physics literature. As in the su(2) case, we can show that (5.87) actually 
defines a representation of sl(2, <C)r, and using the same arguments that we did in 
the su( 2 ) case we conclude that 

Proposition 5.7 The representations (ji, j2)» 2./i . 2/2 € Z are, up to equivalence, 
all the complex finite-dimensional irreducible representations ofsl(2, C)r. 

As in the su(2) case, we deduced (5.87) from just the sl(2, C)r commutation re- 
lations, the finite-dimensionality of V , and the existence of a highest weight vector 
i>o,o satisfying (5.84). Thus if we are given a finite-dimensional sl(2, C)r represen- 
tation and can find a highest weight vector uo.o for some (yj, 72 ), we can conclude 
that the representation space contains an invariant subspace equivalent to (ji , j2) - 

Example 5.34 (jc, C 2 ): The fundamental (left-handed spinor) representation 

Consider the left-handed spinor representation (it, C 2 ) of s 1(2, C)r, which is also 
just the fundamental of 51(2, C)r, i.e. 
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* ( «=M0 -O' 

In this case we have (check!) 

Mi = Si 
Ni= 0 

and hence 

Mo 0 

M~ = f J o) (5.88) 

N+ = N-= 0. 

With this in hand you can easily check that (1, 0) e C 2 is a highest weight vector 
with j\ = 1/2, p = 0. Thus the fundamental representation of s!(2,C)r is just 
(], 0 ). 

Example 5.35 ( S 2j jt, S 2 ' (C 2 )): Symmetric tensor products of left-handed spinors 

As with su(2), we can build other irreps by taking symmetric tensor products. Con- 
sider the 2 yth symmetric tensor product representation (S 2 - 7 : n, S 2 i (C 2 )), 2j e Z and 
the vector 


vo,o = *i ® ■ ■ ■ ® et. e S 2j (C 2 ) . (5.89) 

^ v* ^ 

2 j times 

Using (5.34) it is straightforward to check, as you did in Example 5.30, that this is 
a highest weight vector with eigenvalues (j, 0), and so we conclude that S 2 -* {C 2 ) 
contains an invariant subspace equivalent to (j, 0). Noting that 

dim S 2 j (C 2 ) = dim(j, 0) = 2j + 1 

we conclude that (S 2 ^n, 5 2j ( C 2 )) ^ (j, 0). 

Example 5.36 (it , C 2 ): The right-handed spinor representation 

Consider the right-handed spinor representation (O, C 2 ) from Example 5.6. A quick 
calculation (do it!) reveals that the induced Lie algebra representation (jt . C 2 ) is 
given by 

if(A) = -A f , X e sl(2, C)r. 

In particular we then have 


7t(Si) = Si 


since the S, are anti-Hermitian, as well as 
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jt(Ki) — 7r (i 'Si) — —ijr(Sj) — —iSj. 

We then have 


and hence 


Mi — 0 
Ni = Si 


N+ = 
N- = 


M+ = M-= 0. 


(5.90) 


You can again check that (1, 0) e C 2 is a highest weight vector, but this time with 
j i = 0, 72 = 1/2, and so the right-handed spinor representation of sl(2, C)r is just 


( 0 , 


Example 5.37 (rtij, S 2 -* (C 2 )): Symmetric tensor products of right-handed spinors 


As before, we can build other irreps by taking symmetric tensor products. Again, 
consider the 2 7 th symmetric tensor product representation ( 5 ’ 2 - / 7 r, S 2/ (C 2 )), 2j e Z 
and the vector 

u 0 ,o = ei ® • • • ® ei e S 2j (C 2 ). (5.91) 

2 j times 

Again, it is straightforward to check that this is a highest weight vector with eigen- 
values (0, j), and we can conclude as before that (S 2 - , jf, 5’ 2 - / (C 2 )) is equivalent to 
(0, j). □ 


So far we have used symmetric tensor products of the left-handed and right- 
handed spinor representations to build the (j, 0) and (0, j) irreps. From here, getting 
the general irrep (ji , j 2 ) is easy; we just take the tensor product of (ji, 0 ) and ( 0 , j 2 ) ! 
To see this, let i>o,o € (j, 0) and uq.o e (0, k) be highest weight vectors. Then it is 
straightforward to check that 

i’o,o ® vo,o e (j- 0) ® (0, k) (5.92) 

is a highest weight vector with 71 = 7, 72 = k, and so we conclude that (j, 0 ) <g> ( 0 , k) 
contains an invariant subspace equivalent to (j, k). However, since 

dim[(j, 0 ) <g> ( 0 , k)] = dim(j, k) = (2 j + 1)(2 k + 1 ) 

we conclude that these representations are equivalent, and so in general (switching 
notation a little), 

(jlj 2 )-(jl, 0 )<g>( 0 ,j 2 ). 


(5.93) 
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Thus, all the irreps of sl(2, <C)® can be built out of the left-handed spinor 
(fundamental) representation, the right-handed spinor representation, and 
various tensor products of the two. 

As before, this classification of the complex finite-dimensional irreps of sl(2, C)r 
also yields, with minimal effort, the classification of the complex finite-dimensional 
irreps of SL( 2, C). Any complex finite-dimensional irrep of SL(2. C) yields a com- 
plex finite-dimensional irrep of sl(2, C)*, which must be equivalent to (ji , j2) for 
some 71, 72 and since 

(ji, j 2 ) - (s 2h iz ® s 2h a, s 2jl ( c 2 ) <g> s 2h ( c 2 )) 

we conclude that our original SL( 2,C) irrep is equivalent to (5 2,l n ® 5 2j2 fl , 
S 2 ' l (C 2 ) ® S 2 i 2 (C 2 )) for some 71, 72. Thus, the representations 

(S 2jl n <g> S 2j2 n , S 2jl (C 2 ) <g> S 2 -' 2 (C 2 ) ) , 2j l ,2j 2 eZ 

are (up to equivalence) all the complex finite-dimensional irreducible represen- 
tations of SL( 2, C). 

How about representations of 5(9(3, 1)„? We saw that in the case of 5(9(3), not all 
representations of the associated Lie algebra actually arise from representations of 
the group, and the same is true here. Say we have a complex finite-dimensional irrep 
(IT, V ) of 5(9(3, 1) 0 , and consider its induced Lie algebra representation (k, V), 
which must be equivalent to (ji, j 2 ) for some 71, 72. Noting that 

iM z + iN z =in(S z ) 

we have (again, be sure to distinguish n the number from it the representation!) 

i2n-(iM z +iN z ) ,, J2n(j\ + /o) ,, 

e V z/ V Qj ) = e 

as well as 

e i2n-(iM z +iN z ) Voo = e -2n-n(S z ) V00 

= n( e - 27r52 )u 0 ,o 
= n(/)i> 0 ,o 

= K0,0 

so we conclude that 

^Oi+L) = 1 + 72 e Z, (5.94) 

and thus only representations (ji,j 2 ) satisfying this condition can arise from 
5(9(3, 1) 0 representations. (It is also true that for any j\, 72 satisfying this condi- 
tion, there exists an 5(9(3, 1) 0 representation with induced Lie algebra representa- 
tion (ji , j 2 ), though we will not prove that here.) 

Example 5.38 R 4 : The four-vector representation of 5(9(3, 1) 0 

The fundamental representation ( IT . R 4 ) is the most familiar 5(9(3, 1)„ repre- 
sentation, corresponding to four-dimensional vectors in Minkowski space. What 
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(jl , j 2 ) does it correspond to? To find out, we first complexify the representation to 
( n c , C 4 ) and then consider the induced sl(2, Or representation (tcq, C 4 ). Straight- 
forward calculations show that 


(0 

—i 

0 

°\ 


i 

0 

0 

0 


0 

0 

0 

1 



0 

1 

V 


/° 

—i 

0 


0 \ 

i 

0 

0 


0 

0 

0 

0 


-1 

^0 

0 

-1 


0 ) 


and this, along with expressions for M + and N + that you should derive, can be used 
to show that 


/ 1 \ 


to.O = (e\ +ie 2 ) = 
is a highest weight vector with (J i, j 2 ) = (1/2, 1/2). Noting that 


i 

0 

W 


we conclude that 


dimC 4 = dim 



= 4 



Note that j\ + j 2 = 1/2 + 1/2 = 1 e Z, in accordance with (5.94). □ 


Before moving on to our next example, we need to discuss tensor products of 
s!(2,C)r irreps. The nice thing here is that we can use what we know about the 
tensor product of su(2) irreps to compute the decomposition of the tensor product 
of sl(2, C)r irreps. In fact, the following is true. 

Proposition 5.8 The decomposition into irreps of the tensor product of two 
s 1(2, Or irreps (ji, j 2 ) and (ki, k 2 ) is given by 

(jlj 2 )®(kl,k 2 ) = 0(12,12) 

where \j\ -ki\<h< j\ + k\, \j 2 - k 2 \ <l 2 < j 2 + k 2 (5.95) 

and each (li,l 2 ) consistent with the above inequalities occurs exactly once in the 
direct sum decomposition. 


Notice the restrictions on /i and l 2 , which correspond to the decomposition of 
tensor products of su(2) representations. We relegate a proof of this formula to the 


5. 10 The Irreducible Representations of s 1(2, Or, SL(2 , C) and 50(3, 1), 


203 


Appendix, but it should seem plausible. We have seen that one can roughly think 
of an sl(2, C)r representations as a ‘product’ of two su( 2 ) representations, and so 
the tensor product of two sl(2, (C)r representations (which can both be ‘factored’ 
into su( 2 ) representations) should just be given by the various ‘products’ of su( 2 ) 
representations that occur when taking the tensor product of the factors. We will 
apply this formula and make this concrete in the next example. 

Example 5.39 A 2 R 4 : The antisymmetric tensor representation of SO( 3, 1)„ 

This is an important example since the electromagnetic field tensor F^ v lives in this 
representation. To classify this representation, we first note that A 2 R 4 occurs in the 
<9(3, l)-invariant decomposition 

R 4 ®R 4 = R/ 7 “ 1 ® A 2 (M 4 )® 5 2 '(R 4 ) (5.96) 

where 


>7 1 = 77 ^% ® e v 

is the inverse of the Minkowski metric and 

S 2 ' (R 4 ) = {fe S 2 (R 4 ) | = 0} (5.97) 

is the set of symmetric ‘traceless’ second rank tensors, where the trace is effected 
by the Minkowski metric Note that this is just the <9(3, 1) analog of the 0(n) 
decomposition in Example 5.23. You should check that each of the subspaces in 
(5.96) really is <9(3, 1) invariant. Complexifying this yields 

c 4 ® c 4 = or 1 e a 2 (c 4 ) © s 2 '(c 4 ). 

Now, we can also decompose C 4 <g> C 4 using Proposition 5.8, which yields 

® Q , ^) = (0, °) © ( i, °) © (0, 1) © (1, 1). (5.98) 

Now, clearly Cr /~ 1 corresponds to (0, 0) since the former is a one-dimensional rep- 
resentation and (0, 0) is the only one-dimensional irrep in the decomposition (5.98). 
What about 5 2 (C 4 )? Well, it is straightforward to check using the results of the 
previous example that 

uo.o = (e\ + ief) <g> (e\ + ie 2 ) e ^(C 4 ) 

is a highest weight vector with {j\, jf) = ( 1 , 1 ) (it is also instructive to verify that 
i>o,o actually satisfies the condition in (5.97)). Checking dimensions then tells us 
that S 2 ' ( C 4 ) ~ ( 1 , 1 ), so we conclude that 

A 2 (C 4 ) ~(1,0)©(0, 1). 

This representation is decomposable, but remember that this does not imply that 
A 2 (R 4 ) is decomposable! In fact, A 2 (R 4 ) is irreducible. This is an unavoidable 
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subtlety of the relationship between complex representations and real representa- 
tions. 16 For an interpretation of the representations (1,0) and (0,1) individually, 
see Problem 5.12. 


5.11 Irreducibility and the Representations of 0(3, 1) and Its 
Double Covers 

In this section we will examine the constraints that parity and time-reversal place on 
representations of 0(3, 1) and its double covers. In particular, we will clarify our 
discussion of the Dirac spinor from Example 5.26 and explain why such so(3, 1) 
decomposable representations seem to occur so naturally. 

To start, consider the adjoint representation (Ad, so(3, 1)) of 0(3, 1). It is easily 
checked that the parity operator acts as 

Ad P (L i ) = L i 
Ad P (K i ) = -K i . 

Now say that we have a double cover of 0(3, 1), call it H (these certainly exist 
and are actually non-unique; see Sternberg [16] and the comments at the end of this 
section). H will have multiple components, just as 0(3, 1) does, and the component 
containing the identity will be isomorphic to SL( 2, C). 17 Since H is a double cover, 
there exists a two-to-one group homomorphism O : H — »■ 0(3, 1), which induces 
the usual Lie algebra isomorphism cp : sl(2, C)r — > so(3, 1). Now let P e H cover 
P e 0(3, 1), so that <P(P) = P . Then from the identity 

<t>(Ad h (X))=Ad m) (cl>(X)) Vh e H, X e sl(2, C) R (5.99) 

which you will prove below, we have 

( /,(Adp(Si))=Adp(L i ) 

= U 

as well as 

0(Ad p (Ki)) = -Ki. 

Since (p is an isomorphism we conclude that 

Ad p (S i ) = PS i p- 1 = S i 
Ad p(K i ) = PK i p- 1 =-K i . 

If (II, V ) is a representation of H and (jr, V ) the induced s[(2, C)r representation, 
then this implies that 


16 For the whole story on this relationship, see Onischik [11]. 

17 This should seem plausible, but proving it rigorously would require homotopy theory and would 
take us too far afield. See Frankel [4] for a nice discussion of this topic. 
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YlpMiUZ 1 =N t 

UpN^Z 1 =M t . 


(5.100) 

(5.101) 


Let us examine some consequences of this. Let W C V be an irreducible subspace 
of (7T, V ) equivalent to (j| , . 12 ) , spanned by our usual basis of the form 


B = {vk u k 2 I 0 < < 2j\, 0<k 2 < 2 j 2 } . 


We then have 


i M z n p vo.o — i lip N z i>o,o = ji n p vo.o 
iN z n p vo,o — iHpM z vo,o — jiHpvo,o 
M + Ilpi)o,o = UpN+vo.o = 0 
A^+IlpUo.o = n pM + uo,o = 0 


and thus 11 pvo,o is a highest weight vector for (j 2 ,ji)! We have thus proven the 
following proposition: 

Proposition 5.9 Let H be a double cover of 0(3, 1) and (FI, V) a complex rep- 
resentation of H with induced sl(2, (C)r representation (re, V). If W C V is an 
irreducible subspace of (it, V) equivalent to (ji , j2) a nd j\ j 2 , then there exists 
another irreducible subspace W' of (n, V) equivalent to (j 2 , ji). 

It should be clear from the above that the operator FI(P) corresponding to parity 
takes us back and forth between W and W' . This means that even though W is an 
invariant subspace of the Lie algebra representation (it, V ), W is not invariant under 
the Lie group representation (11, V), since V\(P) takes vectors in W to vectors in 
W'\ If W and W make up all of V . i.e. if V = W ® IT', this means that V is irre- 
ducible under the H representation n but not under the 50(3, 1) representation n, 
and so we have an irreducible Lie group representation whose induced Lie algebra 
representation is not irreducible! We will meet two examples of this type of repre- 
sentation below. Note that this does not contradict Proposition 5.3, as the group H 
does not satisfy the required hypotheses of connectedness. 

Exercise 5.41 Let $ : H — > G be a Lie group homomorphism with induced Lie algebra 

homomorphism </> : (5 — > g. Use the definition of the adjoint mapping and of cf> to show that 

0 (Ad;, (X)) = Ad$ ( /!) (</> (X)) V/t e H, X e f). 

Exercise 5.42 Verify (5. 100) and (5.101). You will need the result of the previous exercise! 

Example 5.40 The Dirac spinor revisited 

As a particular application of Proposition 5.9, suppose our representation (11, V) 
of H contains a subspace equivalent to the left-handed spinor (j, 0); then it must 
also contain a subspace equivalent to the right-handed spinor (0, j). This is why the 
Dirac spinor is (|, 0) © (0, |). It is not irreducible as an SL( 2, C) representation, 
but it is irreducible as a representation of a group H which extends SL( 2, C) and 
covers <9(3, 1). □ 
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The Dirac spinor representation is a representation of //. not 0(3. 1), but many 
of the other sl(2, C)r ~ so(3, 1) representations of interest do come from 0(3, 1) 
representations (for instance, the fundamental representation of so (3, 1) and its vari- 
ous tensor products). Since any 0(3, 1) representation n : 0(3, 1) — > GL(V) yields 
an H representation n o <J> : H GL(V), Proposition 5.9 as well as the comments 
following it hold for 0(3, 1) representations as well as representations of 77. In the 
0(3, 1) case, though, we can do even better: 

Proposition 5.10 Let (IT, V) be a finite-dimensional complex irreducible represen- 
tation of 0(3, 1). Then the induced S0(3, 1) representation (tv, V ) is equivalent to 
one of the following: 

Cl, j), 2; eZ or Cji, j2) 0 (j2,jl), 2p , 2ji £ Z, j\ + j 2 £ Z, j\ j 2 . 

(5.102) 

Proof Since so(3, 1) is semi-simple, (tt, V) is completely reducible, i.e. equivalent 
to a direct sum of irreducible representations. Let W C V be one such representa- 
tion, equivalent to (ji, j 2 ), with highest weight vector i>o,o- Then Id ( Pji.'o.o = vp is 
a highest weight vector with eigenvalues (j 2 , j t), and the same arguments show that 
n(7>o,0 = V T is also a highest weight vector with eigenvalues (j 2 , j 1 ) (it is not 
necessarily equal to up, though). The same arguments also show that n(PF)i>o,o = 
vpp is a highest weight vector with eigenvalues ( j \ , jf) (recall that T is the time- 
reversal operator defined in (4.34)). Now consider the vectors 

uto.o = fo.o + Vpp 
«o,o = Vp + Vp. 

w 0,0 is clearly a highest weight vector with eigenvalues (j 1 , jf), and no.o is clearly 
a highest weight vector with eigenvalues ( 72 , j 1 ). Using the fact that P and T com- 
mute, you can easily check that 

n(P)u;o,o = n(P)w 0 ,o = M o,o 

n(P)M 0 ,o = n(P)M 0 ,o = w 0 ,o- 
We can then define basis vectors 

WkiM = ( M-) kl (N-) k2 w 0 ,o , 0 < k\ < 2j \ , 0 < k 2 < 2 j 2 
UkiM — (M-) kl (N-) k2 u 0 0 , 0 < k\ < 2j 2 , 0 <k 2 < 2j\ 

which span so(3, 1) irreducible subspaces W — (ji, j2) an d U — (j 2 , ji). By Propo- 
sition 5.3 and the connectedness of 50(3, 1) G , W and U are also irreducible under 
50(3, 1) 0 , and from (5.94) we know that j\ + j 2 e Z. Furthermore, by the definition 
of the w , and u k lt k 2 ’ as we d as (5. 100)— (5. 101), we have 


n(P )^ 1? ^ 2 = n (T)w ku k 2 = u k 2 ,ki 

n(P)u kl ,k 2 = n(T)Uki,k2 = ^k 2 ,k\ 
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and so Spanlin^ t k 2 , is invariant under 0(3, 1). We assumed V was irre- 

ducible, though, so we conclude that V = Span {wk l ,k 2 ,Uh,h}- Does that mean 
we can conclude that V — (ji, h) ® (J2 ^ jl) ? Not quite, because we never estab- 
lished that w o,o and uo,o were linearly independent! In fact, they might be linearly 
de pendent, in which case they would be proportional, which would imply that each 
wki,k 2 is proportional to Uk ly k 2 (why?), and also that j\ = y '2 = j ■ In this case, we 
obtain 


V = Span {w kuk2 , ui uh } = Spanfuttj,^} ~ (j, j) 

which is one of the alternatives mentioned in the statement of the proposition. If 
wo.o and mo,o are linearly independent, however, then so is the set {u>k u k 2 , m/,,/ 2 } 
and so 


V = Span {wk 1 ,k 2 ,u/ l j 2 j = W ® U ~ (jl, j 2 ) © (j2, jl) 

which is the other alternative. All that remains is to show that j\ ^ ji, which you 
will do in Exercise 5.43 below. This concludes the proof. □ 

Exercise 5.43 Assume that n>o,o and «o,o are linearly independent and that yj = j 2 . Use 
this to construct a non-trivial 0(3, l)-invariant subspace of V, contradicting the irreducibil- 
ity of V . Thus if V is irreducible and wo t o and «o,o are linearly independent, then j\ ^ j 2 
as desired. 

Example 5.41 0(3, 1): Representations revisited 

In this example we just point out that all the 0(3, 1) representations we have met 
have the form (5.102). Below is a table of some of these representations, along with 
their complexifications and the corresponding so(3, 1) representations. 


Name 

V 

Vc 

so(3, 1) rep 

Scalar (trivial) 

R 

€ 

( 0 , 0 ) 

Vector (fundamental) 

R 4 

C 4 

' 2 ’ 2 ' 

Antisymmetric Tensor (adjoint) 

A 2 R 4 

A 2 C 4 

( 1 , 0 ) ©( 0 , 1 ) 

Pseudovector 

a 3 r 4 

a 3 c 4 

' 2 ’ 2' 

Pseudoscalar 

a 4 r 4 

a 4 c 4 

( 0 , 0 ) 

Symmetric Traceless Tensor 

S 2 '(R 4 ) 

S 2 '(C 4 ) 

( 1 , 1 ) 


Note that the pseudovector and pseudoscalar representations yield the same 
50(3, 1) representations as the vector and scalar, respectively, as discussed in Ex- 
ample 5.18. Thus we have a pair of examples in which two equivalent Lie algebra 
representations come from two non-equivalent matrix Lie group representations! 
Again, this does not contradict Proposition 5.2 since 0(3, 1) is not connected. Note 
also that the only representation in the above table that decomposes into more than 
one so(3, 1) irrep is the antisymmetric tensor representation; see Problem 5.12 for 
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the action of the parity operator on this representation, and how it takes one back 
and forth between the so(3, 1) -irreducible subspaces ( 1 , 0 ) and ( 0 , 1 ). □ 

Before concluding this chapter we should talk a little bit about this mysterious 
group H which is supposed to be a double cover of <9(3, 1). It can be shown 18 
that there are exactly eight non-isomorphic double covers of <9(3, 1) (in contrast to 
the case of SO( 3) and SO( 3, 1)„ which have the unique connected double covers 
SU(2) and 5X(2,C)). Most of these double covers are somewhat obscure and do 
not really crop up in the physics literature, but two of them are quite natural and 
well-studied: these are the Pin groups Pin(3, 1) and Pin(l, 3) which appear in the 
study of Clifford Algebras. Clifford algebras are rich and beautiful objects, and 
lead naturally to double covers of all the orthogonal and Lorentz groups. In the 
four-dimensional Lorentzian case in particular, one encounters the Dirac gamma 
matrices and the Dirac spinor, as well as the Pin groups which act naturally on the 
Dirac spinor. For details on the construction of the Pin groups and their properties, 
see Gockeler and Schiicker [7]. 


5.12 Problems 

5.1 In this problem we will develop a coordinate-based proof of our claim from 
Example 5.22. 

(a) Let X = X'-'et ® e/ e A 2 R", and define 

Z = X ij L( ei ) 0 ej = X ij g ik e k ® ej e £(R") 

<P(X) = [Z] e M„(R). 

Find an expression for <j>(X) in terms of [Z] and use it to show that (pi X) e g. 

(b) Prove that Ad (R) o (p — (p o A 2 IT ( A 1 j by evaluating both sides on an arbitrary 
Z e A 2 R" and showing that the components of the matrices are equal. You will 
need the expansion of Z given above, as well as the coordinate form (5.43) 
of A 2 n. For simplicity of matrix computation, you may wish to abandon the 
Einstein Summation Convention here and write the components of R e G as 
Rjj, even though you are interpreting R as a linear operator. 

5.2 In this problem we will develop a coordinate-free proof that the fundamental 
representation of SL(2, C) is equivalent to its dual. This will also imply that the 
fundamental representation of SU( 2) is equivalent to its dual as well. 

(a) Consider the epsilon tensor in 7jj 2 (C 2 ), 

1 9 

e = e A e~. 

Using the definition (3.84) of the determinant, show that e is SL( 2, C) invariant, 
i.e. that 


18 See Sternberg [16]. 
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e(Ai>, Aw) = e(v, w ) Vv, w e C 2 , A e SL( 2, C). 


(b) Define a map 

L , : C 2 -* C 2 ’ 
v i->- e(u, •)• 

Using your result from (a), show that L € is an intertwiner, and also show that L e 
is one-to-one. Conclude that C 2 ~ C 2 as SL{ 2, C) irreps, and hence as SU{ 2) 
irreps as well. 

(c) Assuming the very plausible but slightly annoying to prove fact that 
(S 2, (C 2 ))* — S 2I (C 2 *), prove that every SU( 2) representation is equivalent to 
its dual. One might think that this is self-evident since taking the dual of an 
irrep does not change its dimension and for a given dimension there is only one 
5(7(2 ) irrep (up to equivalence), but this assumes that the dual of an irrep is 
itself an irrep. This is true, but needs to be proven, and is the subject of the next 
problem. 


5.3 Show that if (IT, V) is a finite-dimensional irreducible group or Lie algebra 
representation, then so is (n*, V*). Note that this together with Problem 2.4 also 
implies the converse, namely that if (11*, V*) is an irrep then so is (II, V ). Use this 
to again prove that every su(2) irrep is equivalent to its dual. 


5.4 In this problem we will prove in coordinates that the fundamental of su(2) is 
equivalent to its dual. Let </> : C 2 — ► C 2 be an intertwiner between these two repre- 
sentations, so that 

<poX = -X T ocp VX e su(2). 

If we represent (p by a matrix M and note that by linearity it suffices to check this 
equation for the basis { S'/ }/ = i 3 of su(2), this becomes 

MSiM~ { =-Sj , ( = 1,2,3. 

Find a matrix M that satisfies this equation. (Hint: Use the intertwiner from Prob- 
lem 5.2.) 


5.5 If G is a matrix Lie group and (n, V ) its fundamental representation, one can 
sometimes generate new representations by considering the conjugate representa- 
tion 


fl : G ->■ GL(V) 

where A denotes the matrix whose entries are just the complex conjugates of the 
entries of A. 

(a) Verify that n is a homomorphism, hence a bona fide representation. Show that 
for G — SU(n), the conjugate representation is equivalent to the dual represen- 
tation. What does this mean in the case of 5(7(2) ? 
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(b) Let G — SL( 2, C). Show that the conjugate to (j, 0 ) is ( 0 , |). There a few ways 
to do this. This justifies the notation n for the representation homomorphism of 

( 0 , 

5.6 Let (it, V) be a representation of a Lie algebra g and assume that zr : g — > 
gl( V) is one-to-one (such Lie algebra representations are said to be faithful). Show 
that there is an invariant subspace of (jt ^ , V ® V*) that is equivalent to the adjoint 
representation (ad, g). 


5.7 In this problem we will meet a representation that is not completely reducible, 
(a) Consider the representation (II, R 2 ) of R given by 


n : R -> GL( 2, R 
a i-» 11(a) = 



Verify that (IT, R 2 ) is a representation. 

(b) If n was completely reducible then we would be able to decompose R 2 as R 2 = 
V © W where V and W are one-dimensional. Show that such a decomposition 
is impossible. 


5.8 In this problem we will show that Z 2 is semi-simple, i.e. that every finite- 
dimensional representation of Z 2 is completely reducible. Our strategy will be to 
construct a Z 2 -invariant inner product on our vector space, and then show that for 
any invariant subspace W, its orthogonal complement W 1 - is also invariant. We can 
then iterate this procedure to obtain a complete decomposition. 

(a) Let (FI, V) be a representation of Z 2 . We first construct a Z 2 -invariant inner 
product. To do this, we start with an arbitrary inner product (-,-)o on V (which 
could be defined, for instance, as one for which some arbitrary set of basis 
vectors is orthonormal). We then define a ‘group averaged’ inner product as 

(v|w) = ^2 ( Yl h v\n h w)a . 

h.Gz'ffj 2 

Show that (• | •) is Zz-invariant, i.e. that 

(n g u|n g w) = (u|w) Vn, w e V, g e Z 2 . 

(b) Assume now that there exists a non-trivial invariant subspace W C V (if no 
such W existed, then V would be irreducible, hence completely reducible and 
we would be done). Define its orthogonal complement 

W L = {u e V | (i>|w) = 0VweW}. 

Argue that there exists an orthonormal basis 

B = {uq, . . . , Wk, Vk+ 1 , . . . , v„} where Wj eff, i = 1 — k, 

and conclude that V = W © W 1 - . 
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(c) Show that W 1 - is an invariant subspace, so that V = W ® W L is a decomposi- 
tion into invariant subspaces. We still do not know that W is irreducible, though. 
Argue that we can nonetheless iterate our above argument until we obtain a de- 
composition into irreducibles, thus proving that (n, V ) is completely reducible. 

5.9 In this problem we will sketch a proof that the space of harmonic polynomials 

has dimension 2/ + 1. 

(a) Recall our notation P k (R 3 ) for the space of kth degree polynomials on M 3 . 
Assume that the map 

A : Pfc(R 3 ) — > P*;_ 2 (M 3 ) (5.103) 

is onto. Use the rank-nullity theorem to show that dim 7f/(M 3 ) = 21 + 1. If you 
need help you might consult Example 3.24, as well as (3.104). 

(b) To complete the proof we need to show that (5.103) is onto. I have not seen a 
clean or particularly enlightening proof of this fact, so I do not heartily recom- 
mend this part of the problem, but if you really want to show it you might try 
showing (inductively on k ) that the two-dimensional Laplacian 

A : P k (R 2 ) — ► A_ 2 (K 2 ) 

is onto, and then use this to show (inductively on k ) that (5.103) is onto as well. 

5.10 Let T and U be commuting linear operators on a complex vector space V, so 
that [T, U] = 0. We will show that T and U have a simultaneous eigenvector. 

Using the standard argument we employed in the proof of Proposition 5.4, show 
that T has at least one eigenvector. Denote that vector by v a and its eigenvalue by a. 
Let V a denote the span of all eigenvectors of T with eigenvalue a; V a may just be 
the one-dimensional subspace spanned by v a , or it may be bigger if there are other 
eigenvectors that also have eigenvalue a. Use the fact that U commutes with T to 
show that V„ is invariant under U (i.e. IJ (v) e V a whenever v e V a )■ We can then 
restrict U to V a to get a linear operator 

U\ Va = U a eC(V a ). (5.104) 

Then use the standard argument again to show that U a has an eigenvector vq € V a . 
Show that this is a simultaneous eigenvector of T and U . 

5.11 In this problem we would like to show that the tensor product ‘distributes’ 
over direct sums, in the sense that for any vector spaces V, W and Z, there exists a 
vector space isomorphism 

<p : (V © W) <g> Z (V <g> Z) © (W <g> Z). 

Define such a map on decomposable elements by 

(p : (u, w) 0 z i-* (v <E> z, w 0 z) 

and extend linearly. Show that </> is linear (you will have to be careful about how 
addition works in the direct sum spaces and the tensor product spaces) as well as 
one-to-one and onto, so that it is a vector space isomorphism. 
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5.12 In this problem we will study the <9(3, 1) representation (A 2 n, A 2 (C 4 ) and 
its decomposition into ( 1 , 0 ) © ( 0 , 1 ) under 5(9(3 , 1) 0 and so(3, 1). 

(a) Define the star operator * on A 2 C 4 by 

* : A 2 C 4 -* A 2 C 4 

et A ej i-> eijkir) km r\ ln e m A e n (5.105) 

and extending linearly. Here r) ln are the components of the inverse of the 
Minkowski metric and €ijki are the components of the usual epsilon tensor 
on R 4 . Show that * is an intertwiner between A 2 C 4 and itself when viewed 
as an 5(9(3, 1) 0 representation, but not as an (9(3, 1) representation. You will 
need (4.15) as well as your results from part (a) of Problem 3.4. 

(b) Compute the action of * on the basis vectors /;, i = 1-6 defined in Exam- 
ple 5.24. Use the /; to construct eigenvectors of *, and show that * is diago- 
nalizable with eigenvalues ±L so that A 2 C 4 decomposes into V+i © V-i where 
V+i is the eigenspace of * in which every vector is an eigenvector with eigen- 
value +i and likewise for V_,-. Then use Schur’s lemma to conclude that any 
5(9(3, 1) 0 -irreducible subspace must lie entirely in V+i or VA, . In particular, this 
means that A 2 C 4 is not irreducible as an 5(9(3, 1) 0 or so(3, 1) representation. 

(c) A convenient basis for V+i which you may have discovered above is 

v+i = /l + '74 
v +2 = fi + ife 
U+3 = h + if 6- 

Likewise, for V-i we have the basis 


v-i = fi - iU 
V-2 = fl~ ifs 
v-3 = /$- ife- 

Now consider the vectors 


ko.o^k+i + iv + 2 


w 0 ,o = v~i +iv - 2 . 


Show that these are highest weight vectors for ( 0 , 1 ) and ( 1 , 0 ), respectively. 
Then count dimensions to show that as so(3. 1) representations, V+i — (0, 1) 
and V-i — ( 1 , 0 ). 

(d) Show directly that A 2 Il(P)(v + j) = v-j, j = 1, 2, 3 and likewise for T, so that 
P and T interchange V+i and V-i, as expected. 


Chapter 6 

The Wigner-Eckart Theorem and Other 
Applications 


In this chapter we will apply what we have learned about representations to quantum 
mechanics, discussing selection rules and in particular the notoriously confusing 
subjects of spherical tensors and the Wigner-Eckart theorem. We will then finish 
our discussion with a brief foray into the beautiful subject of Clifford algebras, in 
an attempt to better understand the Dirac spinor. 


6.1 Tensor Operators, Spherical Tensors and Representation 
Operators 


In this section we introduce the notion of a representation operator, which general- 
izes the notions of vector and tensor operators as well as spherical tensors. We will 
then use representation operators to derive the fundamental quantum mechanical se- 
lection rule which lays the foundation for the various selection rules one encounters 
in standard quantum mechanics courses. 

You will recall from Sect. 3.7 that we defined a vector operator B = { l>, };=i , 2,3 
to be a set of three linear operators that transform as vectors under the adjoint action 
of the total angular momentum operators Jj. With the machinery we have devel- 
oped, we can now recast this definition a bit. More precisely, we can now say that 
if we have an SO(3) representation (II, Ti) on some Hilbert space 'H, then a vector 
operator on H. is a set B = { /!/}/= 1 . 2. 3 of three operators that satisfy 


3 

ni^BjYKR)- 1 =J2 R kiB k Vi = 1,2, 3, ReSO( 3). 

k= 1 


( 6 . 1 ) 


Notice that Tl(R)Bj n(B) = Id j ( A 1 ) B , , so (6.1) just says that under the induced 
representation IlJ of SO( 3) on CCH), the B, transform like the basis vectors e; of 
the fundamental (vector) representation of SO( 3). This is still roughly how vector 
operators are defined in most physics texts. We can go further, however, and inter- 
pret (6.1) as saying that SpanjB,}, considered as a vector subspace of C(H), is an 
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.S’0(3Hn variant subspace (under the representation n[) that is actually equivalent 
to the vector representation of SO( 3) ! This is an important shift in perspective, as we 
are now thinking of the space of operators as a representation space, and of the 
B, as spanning an invariant subspace of this space. This point of view has some 
advantages, and is the one that will be formalized shortly in the notion of a repre- 
sentation operator. Now, to verify that the definition (6. 1 ) is equivalent to our earlier 
definition of a vector operator from Chap. 3, you should check that if we perform 
our usual trick and set R = e ,L i and differentiate at t = 0, as well as identify in (Li) 
with what we have called the total angular momentum operator J j , we obtain 

3 

\Jj , Bj] = i ^2 e i jkBk (6.2) 

k=\ 

which is just (3.70). 

Now, in physics texts one usually defines tensor operators similarly, i.e. as a set 

(r) ■ l 

of operators = I'/ '' } that under the representation n ] of SO( 3) trans- 

form like the basis vectors e n ® • • • <g> e' r <g> ej l (g> ■ ■ ■ ® ey s of the space 77(R 3 ). 
Again, this can be alternately formulated as saying that Span {T n ' lr j l ...j s } c cm) 
is an invariant subspace equivalent to the Tf (R 3 ) representation of SO(3). We have 
seen, though, that tensor product spaces such as T/ (R 3 ) are usually decomposable, 
so it is sometimes advantageous to consider spherical tensors of degree /, which 
are sets of 21 + 1 operators T-6 = {7) (/, }o<£<2/ that under nj transform as the basis 
vectors Vk of the irreducible representation space V/./eZ. 1 Again, we will find it 
useful to think about Span {7^ ) as an invariant subspace of £(77) equivalent to V/, 
rather than focusing on how the individual elements transform. Note that the 
7’ in the superscript of 7j (/i is in parentheses; this is because the / is not really an 
active index, but just serves to remind us which SO( 3) representation we are dealing 
with. (As an additional side note, the term ‘spherical tensor’ derives from the fact 
that the representation V - / , / e Z is equivalent to the space 77/ of spherical harmonics 
of degree / .) 

The following example will help make the foregoing concrete. 

Example 6.1 Product of vector operators 

Consider two vector operators A = {A/}/= 1 , 2,3 and B = ! /7 / }/= 1 . 2 . 3 on some Hilbert 
space 77 with SO( 3) representation n. We can form a tensor operator (of type (0, 2)) 
by considering the set of their products AB = {A/By}, i, j = 1, 2, 3. Each A/By is 
a linear operator on 77, and it is easy to verify that AB is a bona fide (0, 2) tensor 
operator: 


'Warning: In other texts, different normalization and indexing conventions are used for the specific 
basis vectors whose transformation properties the spherical tensors are supposed to mimic. In most 
quantum mechanics texts, for instance, the components of a spherical tensor are supposed to mimic 
the transformation properties of the kets | l,m) which have different labeling and normalization 
conventions than our v £ , even though they are essentially the same thing. 
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nm-fijn®- 1 = [nwniiir^nfflSjnm- 1 ] 

3 

= J2 R kiRijA k B, (6.3) 

k,l = 1 

which is the same as the usual transformation law for the basis vectors {e,- ® e ■•} of 
7^°(]R 3 ). Thus, the nine-dimensional space SpanjA/fiy} is equivalent to X,°(R 3 ) as 
an SO( 3) representation! We know, however, that %° (R 3 ) is reducible, and so then 
(n{, Span{A,fiy}) must be reducible as well. We can decompose Spanj A,- Z?y} into 
irreducible subspaces just as we decomposed (R 3 ) = Kg © S“(K 3 ) © A 2 (R 3 ), 
and we should correspondingly obtain spherical tensors of degrees 0, 1 and 2. Ex- 
plicitly, the degree 0 subspace corresponding to g = J^ 3 =1 e, <g> e, yields a spherical 
tensor T <0) with single element 

To (0) =E^ Bi 

i= 1 

which transforms as a scalar. The degree 1 spherical tensor T (1) corresponding to 
A 2 R 3 has elements 2 

T 0 U) = (A x B) v + i (A x B) v 

r x (1) = — 2(A x B)- 

r 2 U) = -2[(A x B) A - i(A X B) v ] 

where notation like (A x B), is just shorthand for A y B~ — A Z B V . The appearance 
of the cross product here shouldn’t be surprising, since the cross product is essen- 
tially just a way of identifying a bivector (or antisymmetric tensor) in A 2 ]R 3 with a 
vector in R 3 , cf. Example 3.29. The degree 2 spherical tensor T (2) corresponding to 
S 2 (R 3 ) has elements 

70® = (A, + iA y )(B x + iB y ) 

T[ 2) = —2A Z (B X + iB y ) - 2 (A x + iA y )B- 
T 2 (2) = -4 (A X B X + Ay By - 2 A Z B Z ) 

T 3 (2) = 12 A Z (B X + iB y ) + 12 (A x + iA y )B- 
T 4 (2) = 24(A a - iA y )(B x - iBy). 

To convince yourself that these operators really do transform as claimed, it is use- 
ful to compare the above expressions with the expressions for the v k for 'Hi (R 3 ) 
and 7f 2 (R 3 ) (you should have computed these earlier as the f k from Problem 2.2). 
You should also convince yourself that the span of {A, /!,} is equal to the span 


2 The funny looking normalization factors and index convention come from our normalization and 

indexing convention for the v^s, which, as noted above, differ from the conventions in the physics 
literature. 
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of T (0) , T (1) , and T (2> taken together, so that we have the decomposition of 
(n[, Span { A; Bj\) into irreducibles as 

SpanfA/fi/} = Span T (0) © SpanT (1) 0 SpanT (2 \ 

One might also call this a “decomposition of {A/B ; } into spherical tensors”. □ 


By now you may have noticed that the above definitions are somewhat unwieldy; 
they rely on transformation properties of specific operators, and seem to rely too 
heavily on those operators, since the span of those operators seems more significant 
than the operators themselves. To remedy this, we present the following definition, 
which subsumes all of the foregoing definitions and also presents things in a more 
basis-independent fashion: given a representation (Flo, Vo) of a group G on some 
auxiliary vector space Vo, as well as a unitary representation (n, Ti) of G on some 
Hilbert space 'H. we define a representation operator to simply be an intertwiner 
between Vq and £(7i), or in other words a linear map p : Vo — > C(H) satisfying 


p(n 0 (g)u) = n(g)p(r)n(g)-' Vg e G, v e V Q . 
Note that in terms of maps between Vq and C(H), this just says 


(6.4) 


P°n 0 (g) = n|(g)op VgeG. (6.5) 

What this definition is saying, roughly, is that we have a subspace p(V o) C C(H) 
which, even though it is composed of operators acting on H , actually transforms like 
the space Vo under similarity transformations by the operators n„. Note that, strictly 
speaking, the representation operator p is not itself an operator, but an intertwiner 
between representations. 

How does this definition subsume the previous ones? Let Vo have a basis {e,}; 
then since p is linear, it is completely determined by its action on this basis, and 
plugging a basis vector e, into (6.4) then yields 

n(g)p(e;)n(g)“ 1 = (n 0 (g)) ; / p(e i ). (6.6) 

If we set p(p{) = Bi , this becomes 

n(g)B i n(gr 1 = (n 0 (g))/B j . (6.7) 

This, of course, just says that under the representation n[, the B, transform like 
basis vectors of the representation (Hq, Vo), which was how we defined vector op- 
erators, tensor operators, and spherical tensors in the first place! In fact, if we take 
G = SO( 3) and Vo = R 3 , then (6.7) becomes (writing the matrices of the fundamen- 
tal representation with both indices down) 


3 

naoB.niR)-' = J2 R ji B j 

j = i 

which is of course just (6.1). To get tensor operators or spherical operators, we of 
course just have to take Vo = 7J (K 3 ) or Vo = 7Y/(R 3 ), respectively. 
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Fig. 6.1 Action of the orthogonal projection operator P : Tt — > W 2 on a vector v e H 


Before leaving this section, it is worth re-emphasizing the shift in viewpoint that 
we have made: we have taken a representation (J\ , ‘H) of G on a Hilbert space 
TL and then used the induced representation (nj, C('H)) to let G act on the space 
of operators C{V)\ this space (of operators!) may then break up into irreducible 
components, and knowing what irrep an operator lives in may actually tells us quite 
a bit about its action on vectors in TL. In particular we get selection rules and the 
Wigner-Eckart theorem, which are the subjects of the next section. 


6.2 Selection Rules and the Wigner-Eckart Theorem 

Now that we have defined representation operators, we can go on to formulate the 
fundamental selection rule from which the usual quantum mechanical selection rules 
can be derived. Then we can state and prove the Wigner-Eckart theorem, which is 
a kind of complement to the angular momentum selection rules. First, though, we 
need the following fact, the proof of which we only sketch. The details are deferred 
to the problems referenced below. 

Proposition 6.1 Let W 1 and W 2 be finite-dimensional inequivalent irreducible sub- 
spaces of a unitary representation {Yl.TL) equipped with an inner product (■ | ■)■ 
Then W 1 is orthogonal to Wj- 

Proof sketch Define the orthogonal projection operator P : TL — »■ HA to he the map 
which sends v e TL to the unique vector P(v) e Wi satisfying 

(P(u)|w) = (u| w) Viu e W 2 . (6.8) 

This is depicted schematically in Fig. 6.1. You will check in Problem 6.1 that such 
a vector P(v) exists and is in fact unique. If we now restrict P to W 1 we get 
P\W] ■ W 1 — > W 2 , and using the unitarity of n and the invariance of the W, one 
can show that P\w t is an intertwiner. One can then use Schur’s lemma to conclude 
that P\ Wl =0, which then implies that W 1 is orthogonal to MA. To fill out the details, 
see Problem 6.2. □ 
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With this in hand, we can now state and prove 

Proposition 6.2 (Selection rule) Let G be a semi-simple group, and let W \ and VIA 
be finite-dimensional inequivalent irreducible subspaces of a unitary representation 
(n, Ti) ofG. Also let p :V o — >■ C(Ti) be a representation operator, where (Flo, Vo) 
is some auxiliary representation. Then 

[w\\p(v)wf) = 0 Vu e Vo, wteWi (6.9) 

unless the decomposition of Vo ® W 2 into irreducibles contains a representation 
equivalent to W\ . 

Before proving this, let us comment on what this somewhat opaque proposition 
really means. Roughly speaking, the selection rule just says that a vector of the form 
p(v)w 2 is a kind of ‘product’ of elements of Vo and W 2 , and thus transforms like 
something in Vo <8> W 2 = U\ © • ■ • © Uk . Thus, for there to be any overlap with an 
irreducible subspace IV 1 , IV 1 must then be equivalent to one of the f/;. 


Proof of proposition Define a map by 


T : Vo (8) W 2 Ti 

V ® U>2 !->■ p{v)W2 


(6.10) 


and extending linearly to arbitrary elements of Vo ® W 2 . You can check that this is 
a linear map between vector spaces, and so the image T (Vo ® Wi) = T> C Ti is a 
vector subspace of Ti. Now, since G is semi-simple we can decompose Vo ® W 2 
into irreducibles as 


Vo <g> W 2 = Ui © • • • © U k 

for some irreps Uk- As you will show below, the fact that p is a representation 
operator implies that T is in fact an intertwiner, and this further implies that the 
kernel of T (cf. Exercise 4.17) is an invariant subspace of Vo ® W 2 . This means that 
(with a possible relabeling of the U,) we can write the kernel of T as U\ © • ■ ■ © U m 
for some m < k, which then implies that D is equivalent to U m + 1 © • • • © C4. If 
none of the Ui are equivalent to W \, then by Proposition 6.1, every vector in D is 
orthogonal to every vector in IVi, i.e. 

(u>i \p{v)w 2 ) — 0 Vu e Vo, WjeWj, 

which is what we wanted to prove. □ 

Exercise 6.1 Quickly show that p being a representation operator implies that T is an 
intertwiner. Show further that the kernel of T is an invariant subspace of Vo (g> W 2 ■ 

We will now use this generalized selection rule to reproduce some of the familiar 
selection rules from quantum mechanics. 
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Example 6.2 Parity selection rules 

Let (J\ ,'H) be a complex unitary representation of the two-element group T.i — 
{I, P} where P is the parity operator. Let v a . vp e PL be parity eigenstates with 
eigenvalues c a , cp. If c a = 1 then v a spans the one-dimensional trivial representa- 
tion of Ti 2 , and if c a — — 1 then v a spans the alternating representation. Likewise for 
cp. Now let Vo be another one-dimensional irrep of Z 2 with parity eigenvalue co, 
and let B e p(V 0 ), where p is a representation operator. Then a little thought shows 
that the selection rule implies 

(Vp\Bv a ) = 0 unless cp = coc«. 

Thus if is B is parity-odd (co = — 1) then it can only connect states of opposite parity, 
and if it is parity-even (co = +1) then it can only connect states of the same parity. 
If we were looking at dipolar radiative transitions (which emit a photon) between 
electronic states of an atom or molecule, the relevant operators are the components 
of the dipole operator p which is parity-odd (since it is a vector operator). The above 
then tells us that dipolar radiative transitions can only occur between electronic 
states of opposite parity. 

Example 6.3 Angular momentum selection rules 

Let (II, Pi) be a complex unitary representation of SU (2), with two subspaces W\ 
and Wj equivalent to V/ and Vp, respectively, 21,2 j eh. Also suppose we have 
an SU(2) representation operator A : V q — > C{PL), q eh. Then for any v e Wj, 
v' e Wj, A e A (V q ), the selection rule tells us that 

(i/|Ai>)=0 unless \l — q\ < j < Z + q. 

If we again consider a dipolar radiative transition between electronic states of an 
atom or molecule, the relevant operator is still the dipole p whose components pi 
live in p(V 1 ), and so we find that a dipolar radiative transition between states with 
angular momentum j and / can only occur if/ — 1 < y < / + 1 . □ 

The famous Wigner-Eckart theorem can be seen as a kind of complement to the 
angular momentum selection rule above. In the notation of the previous example, 
the Wigner-Eckart theorem says (roughly) that when (v r , Av) is not equal to zero, it 
is still tightly constrained and is in fact determined up to a constant by the fact that 
A is a representation operator. The precise statement is as follows: 

Proposition 6.3 (Wigner-Eckart) Let (II, Pi) be a complex unitary representation 
of SU (2), with two subspaces W\ and Wj equivalent to V; and V j, respectively, 
21, 2j eh. Also suppose we have two SU{ 2) representation operators A, B : Vq — > 
C{PL), q eh, which yield two spherical tensors with components Ap = A(vp), Bp = 
B(i)£), 0 < k <2q + 1. Finally, assume that 

(i/| Apv) ^ 0 for some k and v eW/, v'eWj. 


( 6 . 11 ) 
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Then for all k and w e Wj, w' e Wj, we have 

{w'\B k w) = c(u/| A k w) (6.12) 

for some constant c e C which is independent ofk, w and it/. 

Proof Let C(W/, W f) denote the vector space of all linear maps from W/ to Wj . By 
restricting to Wi and using the orthogonal projection operator If : Ti — >■ Wj , we can 
turn A : V q — >■ COT) into a map 

A : V q -* C(W h Wj) 
v i->- Pj o A(v)\w r 

All we have done here is take the linear operator A(v) e COT) and restrict it to 
W] and then project onto Wj. Now, since Wi, W / are SU(2) -invariant subspaces, 
the vector space CiWt, Wj) actually furnishes a representation (flj, C(Wi, W /)) of 
SU(2) by 

n!(g)r = n(g)rn(gr\ TeC(w,,Wj), gesu( 2 ). (6.13) 

You will check below that this is a bona fide representation, and is in fact equivalent 
to the tensor product representation Wf (g> Wj — W/ <S) Wj ! Furthermore, since the 
action of SU (2) on 'H commutes with restriction and projection (cf. Problem 6.2), it 
is not hard to see that A is an intertwiner. From (6.11) we know that A is not zero, 
and so from Schur’s lemma we conclude that C{Wj , Wj) — W/ <E) Wj has a subspace 
U q equivalent to V q , and that A is a vector space isomorphism from V q to U q . 

Now, we could also use our second representation operator B to construct a sec- 
ond intertwiner B : V q — > U q C C(Wi, Wj). Then we may invoke the corollary of 
Schur’s lemma that you proved in Exercise 5.33 to conclude that B = cA. But this 
then means that 


A(vfc) = cB(ufc) Wk 

=>■ (u/lAfi^jw) = c(w , |B(u J t)w) and weWi, w'eWj 
=>■ (w'\Akw) = c(w'\BkW ) Tk and w e Wi, w' e Wj 

and so we are done. In deducing the last line we used the definition of A, the def- 
inition of the orthogonal projection operator Pj, and the definitions A* = A(vk), 
B k = B(vk). □ 

Exercise 6.2 Show that if T e C(Wi, Wj) then so is n(g)7’TI(g) -1 , so that 
(flj, C(Wi, Wj)) really is a representation; in fact, it is an invariant subspace of (n[, C(H)). 

In analogy to the equivalence between V* ® V and C(V), show that (flj, C(Wi, Wj)) is 
equivalent to Wf ®Wj , which by Problem 5.3 is equivalent to Wi ® Wj . 

You may have noticed that this is not the way the Wigner-Eckart theorem is usu- 
ally stated in advanced quantum mechanics texts like Sakurai [14]. There, the theo- 
rem usually states that the matrix elements of a spherical tensor are proportional to 
the Clebsch-Gordan coefficients. To make this connection, consider the intertwiner 
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T-.V q ®W,-+ Wj 

V !->■ Pj(A(v)w). 

If we work with standard bases {v m } m= o- 2q , [w n } n=0 - 2 h {w' p } p = 0 -2j for V q , W h 
and Wj , then this map has components 

T mn p = T(v m , w n , w ,p ) 

= u> ,p (P J (A(v m )w n )) 

1 . , , 

— m 7~j7 \ w p I Pj (A m Wn)) 

II w p || 

1 . . . 

= .. . I, (w \A m w n ). (6.14) 

\\w' p \\ p 

Now let us switch to the orthonormal bases familiar from quantum mechanics, 
which look like 


\q,m q )eV q —q <m q < q 
|Z, mi) e Wi — l < mi < / 

\j, mjY e Wj — j < m j < j 

(notice the prime on the last set of vectors, which will distinguish it from vectors 
in V q ® Wi with the same quantum numbers). With this basis and notation, the 
components (6.14) of T become the matrix elements '{j, mj\A m \l, mi). 

What do these matrix elements have to do with Clebsch-Gordan coefficients? 
Recall that V q <S> IV/ has two convenient sets of orthonormal basis vectors: 

\q , m q ) ® \l, mi) = \ql\ m q , mi) e V q ® W / , —q<m q <q, —l<mi<l 

1 1', my) e V q ® W/, \l — q\ < l' < l + q , — l' < my < I' . 

The Clebsch-Gordan coefficients are just the inner products ( l',my\ql\ k,m{) of 
these basis vectors. Since T : V q ® Wi —*■ Wj is nonzero, V q ® W/ must contain a 
subspace U; equivalent to Wj, and so we can consider the orthogonal projection 
operator P : V q ® Wi — >■ U j . By Problem 6.1 this is given by 

P : V q ® W, — y Uj 

\ql;m q ,mi) \-> ^ (j, mj\ql\ m q , mi)\j , mj) 

and is an intertwiner by Problem 6.2. By then making the obvious identification of 
Uj with Wj and hence |y, mj) i-* | j, mj)', we get the intertwiner 

P ' : V q ® W, — y Wj 

\ql;m q ,mi) \-± ^ {j , mj\ql\ m q , m t )\j , nij)' 

whose components are nothing but the Clebsch-Gordan coefficients! By Wigner- 
Eckart, though, this intertwiner must be proportional to T, and so its components 
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must be proportional to those of T . We thus have the following component version 
of the Wigner-Eckart theorem: 

Proposition 6.4 (Wigner-Eckart II) Let ( V\ ,'H) be a complex unitary representa- 
tion ofSU{2), with two subspaces Wj and W ; equivalent to V; and V; , respectively, 
21, 2j e Z. Also suppose we have a degree q spherical tensor A = { A mq }. Then 


{j,mj\A mq \l,mi) =c{j,nij\ql\ m q ,m\) 


(6.15) 


where c is a constant independent of mi, m q , and m y. 


6.3 Gamma Matrices and Dirac Bilinears 

We conclude this short chapter with what is essentially an extended example, which 
involves both the representation operators we have met in this chapter as well as 
the 0(3, 1) representation theory we developed in the last. This section relies on a 
familiarity with the theory of the Dirac electron; if you have not seen this material, 
and in particular are unfamiliar with gamma matrices and Dirac bilinears, then this 
section can be skipped. 

Let (D = II © fl, C 4 ) be the Dirac spinor representation of SL{2, C), so that 


D : SL( 2, C) -* GL{ 4, C) 



We claim that the gamma matrices y /( can be seen as the components of a represen- 
tation operator y : R 4 — >■ £(C 4 ) = where R 4 is the Minkowski four-vector 

representation of SL( 2, C). To define y , we need the following two identifications 
of R 4 with // 2 (C), where X = (x, t) e R 4 and cr = (cr x , o y , of): 


R 4 H 2 (C) 

X •<->■ X* = x ■ 0 1 1 

XoX*=x-a—tI. 


(6.16) 

(6.17) 


Note that (6.16) is just the identification we used in Examples 4.20 and 5.5 in defin- 
ing the four-vector representation of SL( 2, C), and (6.17) is just a slight variation of 
that. Now, using the well-known property of the sigma matrices that 


OjO j T o jOj — 28jj , 


(6.18) 


you will verify below that 


X* = r)(X,X)Xf l . 


(6.19) 


Also, if p : SL(2, C) —*■ SO( 3, 1) 0 is the homomorphism from Example 4.20 which 
defines the four- vector representation, then by the definition of p we have 
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(p(A)X)^ = AZ*A t , 
which when combined with (6.19) yields 


(p(A)X)* = A t_1 X*A _1 

as you will also show below. If we then define a map y by 


Y 


M 4 ->• 
Xh- 


M 4 ( C) 

(». 



(6.20) 

( 6 . 21 ) 


you can then use (6.20) and (6.21) to check that y is a representation operator, i.e. 
that 


y(p(A)X) = D{A)y{X)D(A)- { . (6.22) 

If we define the gamma matrices as y IL = yie^ ). then we find that 


Yi = 



Y4 = 




i = 1,2,3 


V5 = K1/2K3K4 = 




Up to a few minus signs that have to do with our choice of signature for the 
Minkowski metric, as well as the fact that most texts write yo instead of y 4 , this 
is the familiar chiral representation of the gamma matrices (the chiral representa- 
tion is the one in which is diagonal). If we let A M |: be the components of p(A), 
we can take X — in (6.22) to get 


A /y v = D{A) Yll D{A)- { 


(6.23) 


which is how (6.22) usually appears in physics texts. Another important property is 
that the gamma matrices satisfy the fundamental anticommutation relation 


V ji Yv Yv Y/i — 2r/y ; u 

which you can easily verify. 


(6.24) 


Exercise 6.3 Verify (6.19), (6.21), (6.22), and (6.24). 

Now we would like to construct the Dirac bilinears. Denote an element of C 4 by 


*-{% 

where i f/L and if/ r are two-component spinors living in (j, 0) and (0, j), respec- 
tively. Also define the row vector i Jr by 
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where the V denotes complex conjugation (note that the positions of the right and 
left-handed spinors are switched here). You can easily check that if i jr transforms 
like a Dirac spinor, i.e. x/r m- l)( A) iJ/, then i jr transforms like 

) = ^D(A)-‘. (6.25) 

If we consider the associated column vector xj/ T , it transforms like 



which you should recognize as the representation dual to the Dirac spinor, since 
the matrices of dual representations are just the inverse transpose of the original 
matrices. However, we know from Problem 5.2 that (j, 0 ) and ( 0 , |) are equivalent 
to their duals, so we conclude that i Jr transforms like a Dirac spinor as well. 

With ij/ in hand, we can now define the Dirac bilinears 

• i/n/r scalar 

• vector 

• (A Y/i Yv M 7 ^ v antisymmetric 2-tensor 

• 4 r YnYvYpi r ’ M 7 ^ v 7 ^ P pseudovector 

• lA P'S 1 A pseudoscalar 

Each of these contains a product of two Dirac spinors, and can be seen as compo- 
nents of tensors living in C 4 <g> C 4 . Note the similarities between the names of the 
Dirac bilinears and the first five entries of the table from Example 5.41. This makes 
it seem like the Dirac bilinears should transform like the components of antisym- 
metric tensors (of ranks 0 through 4). Is this true? Well, using (6.25), we find that 
the scalar transforms like 

i Jr i — >- \[r D (A)~ 1 D — x[rxlr 

and so really does transform like a scalar. Similarly, using (6.23), we find that the 
vector transforms like 

i'Ypi' 1 — * fD(A)~ { YpD{A)^j/ 

— JA~ l p V 'lfYvf 

and so really does transform like a vector. Using the anticommutation relation 
(6.24), you can similarly verify that the antisymmetric 2-tensor, pseudovector and 
pseudoscalar transform like antisymmetric tensors of ranks 2, 3, and 4, respectively. 
If we let A*(R 4 ) denote the set of all antisymmetric tensor products of R 4 , i.e. 

4 

a¥e0a 1 i 4 , 

k= 0 

and let A* R 4 denote its complexification, then C 4 ® C 4 contains a subspace equiv- 
alent to A* R 4 . However, one can check that both spaces have (complex) dimension 
16, so as 51(2, C)r representations 
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C 4 <g> C 4 ~ A*K^, 


(6.26) 


i.e. the tensor product of the Dirac spinor representation with itself is equiva- 
lent to the space of all antisymmetric tensors! 

Another way to obtain this same result is to use our tensor product decomposition 
(5.95). The tensor product C 4 <S) C 4 of two Dirac spinors is given by 




= (1,0)® (0,0)© Q, ^ © Q, ^ © (0, 1) © (0,0) 

® (O’ 0) ( 6 - 27) 

where in the first equality we used the fact that the tensor product distributes over 
direct sums (see Problem 5.11), in the second equality we used (5.95), and in the 
third equality we just rearranged the summands. However, from Example 5.41 we 
know that as an so(3, 1) representation, A*Rjj( decomposes as 

1 1\ 

2’2j 0( °’ O) 

which is just (6.27)! 

Exercise 6.4 Verify (6.25). Also verify that the antisymmetric 2-tensor, pseudovector and 
pseudoscalar bilinears transform like the components of antisymmetric tensors of rank 2, 3 
and 4. 


A* 


— (0, 0) < 


W )©(1, 0)®(0, D© 


= ( 0 , 0 ). 


^ 10 ( 1 , 0 ) 0 ( 0 , 1)0 


6.4 Problems 

6.1 In this problem we will establish a couple of the basic properties of the orthog- 
onal projection operator. To this end, let H be a Hilbert space with inner product 
(• | •) and let W be a finite-dimensional subspace of TL. 

(a) Show that for any v e Pi, there exists a unique vector P(v) eW such that 

(P(tO|u/) = (d|u/) Vut'eW. 

This defines the orthogonal projection map P : H —*■ W which projects 7 j 
onto W. (Hint: there are a few ways to show that P(v) exists and is unique. 
One route is to consider the map L :H —> H* given by L(v) — ( r | ■ ) and then 
play around with restrictions to W .) 

(b) Quickly show that P(w ) = w for all w e W, and hence that P 2 = P. 
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(c) Let { e / } be a (possibly infinite) orthonormal basis for Pi where the first k vectors 
e,-, i = 1, . . . , k are a basis for W. Show that if we expand an arbitrary v ePi 
as v — v' e; where the implied sum is over all i (and where the sum may be 
infinite), then P takes the simple form 

k 

P(v ) = P(v'ei) — yVe,-. 

i=t 

Thus P can be thought of as ‘projecting out all the components orthogonal 
to W’. 

(d) Let {e ji} be an arbitrary ON basis for Pi. Show that in this case the action of P 
is given by 

k 

P(ey) = 

i=i 

6.2 Let (FI, Pi) be a unitary representation of a group G on a Hilbert space Pi. In 
this problem we will show that inequivalent irreducible subspaces of Pi are orthog- 
onal. 

(a) Let W C Pi be a finite-dimensional irreducible subspace and P : Pi -» W the 
orthogonal projection operator onto W. Use the defining property of P , the 
unitarity of n, and the invariance of W to show that P is an intertwiner. 

(b) Let V C Pi be another irreducible subspace inequivalent to W. Restrict P to V 
to get P\v : V — > W, which is still an intertwiner. Use Schur’s lemma to deduce 
that P\y =0, and conclude that V and W are orthogonal. 



Appendix 

Complexifications of Real Lie Algebras 
and the Tensor Product Decomposition 
of 51(2 , C)r Representations 


The goal of this appendix is to prove Proposition 5.8 about the tensor product 
decomposition of two sl(2, C)r representations. The proof is long but will intro- 
duce some useful notions, like the direct sum and complexification of a Lie al- 
gebra. We will use these notions to show that the representations of sl(2, C)r 
are in 1-1 correspondence with certain representations of the complex Lie alge- 
bra s 1(2, C) © sl(2, C), and the fact that this complex Lie algebra is a direct sum 
will imply certain properties about its representations, which in turn will allow us to 
prove Proposition 5.8. 


A.l Direct Sums and Complexifications of Lie Algebras 

In this text we have dealt only with real Lie algebras, as that is the case of greatest 
interest for physicists. From a more mathematical point of view, however, it actually 
simplifies matters to focus on the complex case, and we will need that approach 
to prove Proposition 5.8. With that in mind, we make the following definition (in 
total analogy to the real case): A complex Lie algebra is a complex vector space 
0 equipped with a complex-linear Lie bracket [•,•]: g x g —*■ g which satisfies the 
usual axioms of antisymmetry 

[X,Y] = -[Y,X] VX, Teg, 

and the Jacobi identity 

[[X, Y], Z] + [[Y, Z], X] + [[Z, X], Y] = 0 VX, Y, Z e g. 

Examples of complex Lie algebras are M n (C ) = g!(n,C), the set of all complex 
n x n matrices, and si (n, C), the set of all complex, traceless n x n matrices. In both 
cases the bracket is just given by the commutator of matrices. 

For our application we will be interested in turning real Lie algebras into complex 
Lie algebras. We already know how to complexify vector spaces, so to turn a real Lie 
algebra into a complex one we just have to extend the Lie bracket to the complexified 
vector space. This is done in the obvious way: given a real Lie algebra g with bracket 
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we define its complexification to be the complexified vector space Qr = C ® g 
with Lie bracket t-.-]c defined by 


[1 0 Xi + i ® A 2 , 1 ® Yi + i ® Y 2 ] c 
= 1 ® [X u Y x ] - 1 ® [X 2 , Y 2 ] + i ® [X U Y 2 ] + i ® [ X 2 , Y^ 

where A, . Y, e g, i = 1,2. If we abbreviate i ® X as iX and 1 ® X as A, this tidies 
up and becomes 

[A! + iX 2 , Yi + iY 2 ] c = [A x , Yi] - [X 2 , Y 2 ] + /([A!, Y 2 ] + [A 2 , Li]). 

This formula defines [-,-]c in terms of and is also exactly what you would get 
by naively using complex-linearity to expand the left hand side. 

What does this process yield in familiar cases? For su(2) we define a map 


0:s u(2) c — >sl(2,C) 

1 ® Ai + i ® X 2 i — > Xi + ^ ^ 

You will show below that this is a Lie algebra isomorphism, and hence su(2) 
complexifies to become sl(2,C). You will also use similar maps to show that 
u(«)c — 0 [(h, R)c — fll( n, C). If we complexify the real Lie algebra sl(2, C)r, we 
also get something nice. The complexified Lie algebra (sl(2, C)r)c has complex 
basis 


A 2 , Ai, A 2 esu(2). 


(A.l) 


Mi = -(1 ® S, — i ® Ki), i = 1,2,3 

A fi = -(1 ® Si + i <g> Ki), i = 1,2,3 

which we can again abbreviate 1 as 


Mi= l -{Si-iKi), i= 1,2,3 
Ni = '-(Si + i Kf), i= 1,2,3. 


(A. 2) 


These expressions are very similar to ones found in our discussion of sl(2, C)r rep- 
resentations, and using the bracket on (5 1(2, C)r)c one can verify that the analogs 
of (5.83) hold, i.e. that 


[MiMj] C = 0 

3 

[Mi , Mj]c = ^2 ZijkMk 

k= 1 

3 

[A/i , A/)]c = ^2 CijkNk- 

k=\ 


(A. 3) 


1 Careful here ! When we write i K [ , this is not to be interpreted as i times the matrix K , , as this 
would make the A f, identically zero (check!); it is merely shorthand for i ® A,- . 
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Notice that both Span {Ad, } and Span{A//} (over the complex numbers) are Lie sub- 
algebras of (sl(2, <C)r)c and that both are isomorphic to sl(2, C), since 


SpanfAd,} ~ Span{A/;} — su(2)c — sl(2, C). 

Furthermore, the bracket between an element of Span {Ad; } and an element of 
Span{A/;} is 0 , by (A. 3). Also, as a (complex) vector space (sl( 2 , C)r)c is the direct 
sum SpanjAd, 1 © Span {Ad }■ When a Lie algebra g can be written as a direct sum 
of subspaces W i and ML, where the VV; are each subalgebras and [up, W 2 ] = 0 for 
all w\ e Wi, u>2 e W2, we say that the original Lie algebra g is a Lie algebra direct 
sum of Wi and ML, and we write g = W 1 ® ML - 2 Thus, we have the Lie algebra 
direct sum decomposition 


(sl(2, C)r) c ~ sl(2, C) © sl(2, C). 

This decomposition will be crucial in our proof of Proposition 5.8. 


(A.4) 


Exercise A.l Prove that (A. 1 ) is a Lie algebra isomorphism. Remember, this consists of 
showing that 0 is a vector space isomorphism, and then showing that (ft preserves brackets. 
Then find similar Lie algebra isomorphisms to prove that 


u(n)c — 0 l(n,C) 
0 l(n,R)c — gl(n.C). 


A.2 Representations of Complexified Lie Algebras and 
the Tensor Product Decomposition of si (2, C)r 
R epresentations 

In order for (A.4) to be of any use, we must know how the representations of a 
real Lie algebra relate to the representations of its complexification. First off, we 
should clarify that when we speak of a representation of a complex Lie algebra 
g we are ignoring the complex vector space structure of g; in particular, the Lie 
algebra homomorphism jt : g — > gl(F) is only required to be real-linear , in the 
sense that 7 t(cX) = ctt ( X ) for all real numbers c. If g is a complex Lie algebra, the 
representation space V is complex and tt(cX ) = ctt(X) for all complex numbers c, 
then we say that n is complex-linear. Not all complex representations of complex 
Lie algebras are complex-linear. For instance, all of the sI(2,C)r representations 
described in Sect. 5.10 can be thought of as representations of the complex Lie 
algebra sl(2, C), but only those of the form (j, 0) are complex-linear, as you will 
show below. 


2 Notice that this notations is ambiguous, since it could mean either that g is the direct sum of W 1 
and W 2 as vector spaces (which would then tell you nothing about how the direct sum decomposi- 
tion interacts with the Lie bracket), or it could mean Lie algebra direct sum. We will be explicit if 
there is any possibility of confusion. 
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Exercise A.2 Consider all the representations of sl(2, C)r as representations of s((2, C) as 
well. Show directly that the fundamental representation ( \ , 0) of s 1(2, C) is complex-linear, 
and use this to prove that all sl(2, C) representations of the form (j, 0) are complex-linear. 
Furthermore, by considering the operators A, defined in (5.82), show that these are the only 
complex-linear representations of sl(2, C). 

Now let g be a real Lie algebra and let (it, V ) be a complex representation of g. 
Then we can extend (: r, V ) to a complex-linear representation of the complexifica- 
tion gc in the obvious way, by setting 

t T(Xi+iX 2 ) = 7r(Xi) + ijr(X 2 ), X l ,X 2 eg. (A.5) 

(Notice that this representation is complex-linear by definition, and that the operator 
in(X 2 ) is only well-defined because V is a complex vector space.) Furthermore, 
this extension operation is reversible: that is, given a complex-linear representation 
(tt, V) of 0c, we can get a representation of g by simply restricting tc to the subspace 
{ 1 <g) X + / <g> 0 | X e g} ~ g, and this restriction reverses the extension just defined. 
Furthermore, you will show below that (tt, V ) is an irrep of g,f if and only if it 
corresponds to an irrep of g. We thus have 

Proposition A.l The irreducible complex representations of a real Lie algebra g 
are in one-to-one correspondence with the irreducible complex-linear representa- 
tions of its complexification gc. 

This means that we can identify the irreducible complex representations of g with 
the irreducible complex-linear representations of gc, and we will freely make this 
identification from now on. Note the contrast between what we are doing here and 
what we did in Sect. 5.9; there, we complexified real representations to get complex 
representations which we could then classify; here, the representation space is fixed 
(and is always complex!) and we are complexifying the Lie algebra itself, to get a 
representation of a complex Lie algebra on the same representation space we started 
with. 

Exercise A.3 Let (n, V) be a complex representation of a real Lie algebra g, and extend it 
to a complex-linear representation of gc- Show that ( tt , V) is irreducible as a representation 
of g if and only if it is irreducible as a representation of gc- 

Example A.l The complex-linear irreducible representations of s 1(2, C) 

As a first application of Proposition A.l, consider the complex Lie algebra sl(2, C). 
Since sl(2, C) ~ su(2)c, we conclude that its complex-linear irreps are just the ir- 
reps ( 7 tj , Vj ) of su(2) ! In fact, you can easily show directly that the complex-linear 
sl(2, C) representation corresponding to (7 Tj, Vj) is just (j, 0). □ 

As a second application, note that by (A. 4) the complex-linear irreps of 
51(2, C) ®sl(2,C) are just the representations (ji , j2) coming from sl(2, <C)r. Since 
sl(2, C) ® 51(2, C) is a direct sum, however, there is another way to construct 
complex-linear irreps. Take two complex-linear irreps of 51(2, C), say (7 T/ l , Vj ] ) 
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and ( jtj 2 , V j 2 ) . We can take a modified tensor product of these representations such 
that the resulting representation is not of sl(2, C) but rather of 51(2, C) © st(2, C). 
We denote this representation by (jtj l <g> 7tp, Vj l <g> Vj 2 ) and define it by 

(Ttp <g> 7T j2 )(X u X 2 ) = n jl (Xj) <g> I + I <g> 7tj 2 (X 2 ) e C(V\ <g> V 2 ), 

X U X 2 esl(2,C). (A. 6) 

Note that we have written the tensor product in Uj l ® j Xj 2 as '(§>’ rather than ‘®’; 
this is to distinguish this tensor product of representations from the tensor product 
of representations defined in Sect. 5.3. In the earlier definition, we took a tensor 
product of two g representations and produced a third g representation given by 
(n i <g> jt 2 )(X) — 7T i (X) <g> / + 1 <g> 7t 2 (X), where the same element X e g gets fed 
into both u\ and tc 2 \ here, we take a tensor product of two g representations and 
produce a representation of g © g, where two different elements X[ , X 2 e g get fed 
into it i and jt 2 . 

Now, one might wonder if the representation (jx Vj l <8> V jf) of 51(2, C) © 

sl(2, C) defined above is equivalent to (ji , j2) ; this is in fact the case! To prove this, 
recall the following notation: the representation space Vj t ® V J2 is spanned by vec- 
tors of the form <E) , ki =0, ... ,2 jf, i — 1,2 where the are characterized 

by (5.76). Similarly, the representation space Vq ] j 2 ) of (ji, j 2 ) is spanned by vectors 
of the form j< 2 , h, = 0, . . . , 2 , i = 1, 2, where these vectors are characterized 
by (5.87). We can thus define the obvious intertwiner 

(j ) : Vj 2 <g> Vj 2 ->■ V(j l ,j 2 ) 
v ki ® v kl v kuk2 . 

Of course, we must check that this map actually is an intertwiner, i.e. that 

0 O [(Try, <8> n h ){Xu X 2 )] = 7r 0l , /2) (X,, X 2 ) o 0 VX lf X 2 e sl(2, C). (A.7) 

Since sl(2, C) © sl(2, C) is of (complex) dimension six, it suffices to check this for 
the six basis vectors 


a M t , o) 

(0, iN z ) 

(M+,0) = (iM x -M y ,0) 

(M-,0) = (iM x + M y ,0) 

(0 ,Af+) = (0,iAf x -J\fy) 

(0 ,J\f-) = (0,iJV x +J\f y ), 

where the Ad, and Af, were defined in (A.2). We now check (A.7) for (i Ad ; ,0) 
on an arbitrary basis vector v k] <g> v kl , and leave the verification for the other five 
s 1(2, C) © s((2, C) basis vectors to you. The left hand of (A.7) gives (careful with 
all the parentheses!) 
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<p(fjt ji ®nj 2 )(iM z ,0)(vk 1 0 v k2 ))=<p((J z ®I)(v kl 0 v k2 )) 

= </>((;' i -h)v kl ® v k2 ) 

= O'l - &l)v*i,*2 

while the right hand side is 

® %)) = ( 7r (7i,72)0'- A/J --0))(^i,fe) 

= ZZW- ^k\,k2 

— O'l - 

and so they agree. The verification for the other five sl(2, C) ©sl(2, C) basis vectors 
proceeds similarly. This proves the following proposition: 

Proposition A.2 Let (ji,j 2 ) denote both the usual s1(2,(C)r irrep as well as 
its extension to a complex-linear irrep of (sl(2, C)r)c — sl(2, C) © s 1(2, C). Let 
(jtj l ® i tj 2 , Vj l ® Vj 2 ) be the representation of sl(2, C) © sl(2, C) defined in (A. 6). 
Then 


(jl Jl) - (itji 0 tt j2 , V h 0 V j2 ). 


(A. 8) 


Exercise A.4 Verify (A.7) for the other five sl(2, C) © sl(2, C) basis vectors, and thus 
complete the proof of Proposition A.2. 

We will now use Proposition A.2 to compute tensor products of the (sl(2, C)r)c 
irreps (ji, j2)» which by Proposition A.l will give us the tensor products of the 
(ji,j 2 ) as sl(2, C) K irreps, which was what we wanted! In the following computa- 
tion we will use the fact that 


(7Tj l 0 Jtj 2 ) 0 (jt kl 0 7 tk 2 ) ~ (7T/J ®> JT kl ) 0 (j Xj 2 0 JV k2 ), (A. 9) 

which you will prove below (note which tensor product symbols are ‘barred’ and 
which are not). With this in hand, and using the su(2)c — sl(2, C) tensor product 
decomposition (5.68), we have (omitting the representation spaces in the computa- 
tion) 


(jl,j2)0 (ki,k 2 ) 


(ttj i ®ttj 2 ) 0 (Jt kl 0 itk 2 ) 

(nj l 0 n kl ) 0 (jtj 2 0 Jtk 2 ) 

( ji+ki \ / j 2 +k 2 \ 

0 j <0 ( 0 ^ 2 ) 

fi=|ji-*il / \h=\j2-k 2 \ / 

0 7 r h 0 7T/ 2 where 1 7 ) - k t | < /, < j, + k h 7 = 1,2 
(h,h) 

0 (li , 1 2 ) where \j t - k t \ < /; < ji + k t , 7 = 1,2. 

(h,h) 


This gives the tensor product decomposition of complex-linear (s 1(2, C)r)c irreps. 
However, these irreps are just the extensions of the irreps (ji, j 2 ) of sl(2, C)r, and 
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you can check that the process of extending an irrep to the complexification of a Lie 
algebra commutes with taking tensor products, so that the extension of a product is 
the product of an extension. From this we conclude the following: 

Proposition A.3 The decomposition into irreps of the tensor product of two 
sl(2, C) R irreps (jl J 2 ) and (ki, k 2 ) is given by 

(ii.j2)®(ki,k 2 ) = 0(Z 1 ,/ 2 ) 
where \ji -k x \<h< j\ + k x , \j 2 - k 2 \ <l 2 < j 2 + k 2 
which is just Proposition 5.8. 

Exercise A.5 Prove (A.9) by referring to the definitions of both kinds of tensor prod- 
uct representations and by evaluating both sides of the equation on an arbitrary vector 
(Xi, X 2 ) esl(2,<C)®s[(2,C). 
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Anti-Hermitian operators 
Lie algebra of, 127 
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2nd rank tensor representation, 149 
matrices, 16, 116, 118 
tensor, 70 

tensors and Lie algebras of isometry 
groups, 177 
Axial vector, 75 
Axis of rotation, 74, 138 

B 

Baker-Campbell-Hausdorff formula, 1 17 

Basis (for a vector space), 15 

Bilinear, 27 

Bivector, 75 

Body axes, 19 

Body frame, 78 


Boost, 100, 125, 139 
generators, 119 
Bosons, 71, 111 
Bra, 32 

C 

C(P), 128, 158 
C\ 11 

as complexification of R" , 1 94 
Canonical 

transformation, 129 
variables, 131 
Cartesian product, xv 

Chiral representation (of gamma matrices), 
223 

Clebsch-Gordan coefficients, 66, 221 
Clifford algebra, 208 
Closed (group), 112 
Cofactor expansion of determinant, 73 
Commutator, 116 
Completely reducible, 180 
Completeness (of vector space), 29 
Complex Lie algebra, 227 
Complex vector space, 1 1 
Complex-linear, 229 
Complexification 
of Lie algebra, 227 
of representation, 194 
of vector space, 194 
Components 

contravariant, 32, 46 
covariant, 32, 46 
of dual vector, 25 
of matrix, 22 
of Minkowski metric, 29 
of tensor, 4, 40, 55 
of vector, 17 
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Conjugate representation, see representation, 
conjugate 
Conjugate-linear, 27 
Connected group, 98 
Contraction, 56 

Contravariant, see components, contravariant 
Covariant, see components, covariant 
Cross product, 76 
Cyclic permutation, 72 

D 

Decomposable, 180 
Determinant, 73, 82, 83 

and oriented volume, 74, 83 
and sgn homomorphism, 110 
of a matrix exponential, 119 
Dimension (of a vector space), 15 
Dirac bilinear, 224 
Dirac delta functional, 33, 61 
Dirac notation, 32, 40, 57, 61, 66 
Dirac spinor, 147, 151, 183, 205, 222, 224 
and antisymmetric tensors, 225 
Direct sum, 178, 183 
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internal, 183 
of Lie algebras, 229 
representation, 183 
Direct sum decomposition 
of A 2 R 4 , 181 
of L 2 (5 2 ), 182 
of M„(C), 179 
of M„(R), 179 

of tensor product representations, 180 
Disconnected group, 98 
Dot product, 28 
Double cover 

of 50(3) by 5(7(2), 107 
of 50(3, 1) 0 by 5L(2,C), 108 
Double dual, 35 
Dual 

basis, 26 
metric, 36 

representation, 161, 162 
space, 25 

vector, see linear functional 

E 

6 , see Levi-Civita tensor 

Einstein summation convention, 39, 46 

Electromagnetic field tensor, 60 

transformation properties of, 149, 181, 203 
Entanglement, 67 

Equivalence of representations, 169 
Euler angles, 97 


Euler’s theorem, 138 
Exponential of matrix, 114 

F 

Faithful representation, 210 
Fermions, 71, 111 
Four- vector, 147 

Four- vector representation, see representation, 
four-vector 

Fourier coefficients, 20 
Fourier series, 20 
Fourier transform, 62 
Fundamental representation, see 

representation, fundamental 

G 

GL(n , R) , see general linear group 
GL(V), see general linear group 
0 l(n, C),115 
0 l(O, 127 

Gamma matrices, 222, 223 
chiral representation of, 223 
General linear group, 14, 90, 115 
Generators, 1 1 1 
Gram-Schmidt, 27 
Group, 81 
axioms, 88 
definition of, 88 

H 

H n (C) , see Hermitian matrices 
n,( r 3 ), see harmonic polynomials 
7 -Li , see spherical harmonics 
Hamilton’s equations, 158 
Harmonic polynomials, 12, 16, 23, 34, 48, 69, 
155, 192 

Heisenberg algebra, 130, 157, 160 
Heisenberg equation of motion, 2 1 
Heisenberg picture, 52 
Hermite polynomials, 37 
Hermitian adjoint, 35 
Hermitian matrices, 11, 16 
Hermitian scalar product, 28 
Hermiticity, 27 

Highest weight vector, 189, 197 
Hilbert space, 17, 29 
Hilbert-Schmidt norm (on M„(C)), 1 17 
Homomorphism 
group, 103 
Lie algebra, 131 

I 

Identical particles, 71, 150 
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raising and lowering, 32 
Inner product, 27 
on C" , 28 
on L 2 ([—a, a]), 29 
on M„(C), 28 
on R" , 28 

Inner product space, 27 
Intertwiner, 169 

Intertwining map, see intertwiner 
Invariant subspace, 179 
non-trivial, 179 
Inversion, 75, 98 
Invertibility, 22, 83 

Irreducible representation, 180, 184, 185 
of a real Lie algebra and its 
complexification, 230 
of abelian group, 187 
of double-cover of 0(3, 1), 205 
of 0(3, 1), 206-208 
of SL(2, C), 201 
of 50(3), 192 
of 50(3, 1) 0 , 201 
of <s[(2, €),230 
of sl(2, C)r, 196-201 
of st(2, €) e sl(2, C), 230-232 
of su(2), 188-190 
of Z 2 , 187 

Irrep, see irreducible representation 
Isometry, 90, 104 
Isomorphism 
group, 103 
Lie algebra, 131 

J 

J+, 188, 191, 192 
188, 191, 192, 194 
J z , 188, 191, , 192, 194 
Jacobi identity, 116, 133 

K 

Kernel, 105 
Ket, 32 

Killing form, 137, 143 
Kinetic energy of rigid body, 6 

L 

L 2 ([-a, a]), 12, 16, 20, 28, 32, 60 

L 2 ( R), 12, 60 

L 2 (R 3 ), 64, 160 

L 2 (5 2 ), 155, 182 

C{V), see operators, linear 

Laguerre polynomials, 37 

Left-handed spinor, see representation, spinor 


Legendre polynomials, 37 
Levi-Civita symbol, see Levi-Civita tensor 
Levi-Civita tensor, 41, 72, 1 10, 208, 212 
Lie, Sophus, 1 1 1 
Lie algebra. 111, 115-131 
abstract, 127 
complex, 227 
definition, 115 
direct sum, 229 
physicist’s definition, 119 
Lie algebra homomorphism, 131 
induced, 133 

Lie algebra representation, 145 
Lie bracket 

and similarity transformation, 137 
Lie group, 1 1 1-113 
dimension of, 112 
Lie product formula, 116, 134 
Lie subalgebra, 1 28 
Linear combinations, 14 
Linear dependence, independence, 15 
Linear functional, 25 
Linear map, 104 
Linear operator 

as (1, 1) tensor, 39, 57 
definition of, 2 1 
Lie algebra of, 127 
Linearity, 21 
Lorentz force law, 60 
Lorentz group, 93, 99-102 
Lorentz transformations, 49, 99 

decomposition into boost and rotation, 139 

M 

M„ (C),227 

as complexification of M„(R), 194 
M n (R) , M n (C) , 11, 16 
Matrix 

exponential, 114 
logarithm, 142 
of linear operator, 23 
of Minkowski metric, 29 
Matrix Lie group, 1 1 2 
Maxwell stress tensor, 59 
Metric, 27 

as intertwiner between V and V* , 173 
Minkowski metric, 28, 93 
Moment of inertia tensor, 6, 42, 58 
Momentum representation, 62 
Multilinearity, 3, 39 

Multiplicative quantum number, 160, 161 
Multipole moments, 42, 69 
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Neutrino, 147 
Noether's theorem, 129 
Non-abelian (group), 88 
Non-commutative (group), 88 
Non-degeneracy, 27 
Non-degenerate Hermitian form, 27 
Null space, 106 
Nullity, 106 

O 

0(3), 98, 115 
0(3, 1), 101, 138 
o(3), 115 
o (n - 1, 1), 119 
o (n), 118 

0(n — 1, 1), see Lorentz group 
0(n), see orthogonal matrices, orthogonal 
group 

Observables, 128 
One-parameter subgroup, 114 
One-to-one, 22 
Onto, 22 
Operators 

adjoint of, 35 

angular momentum, 21,23 
exponential of, 34 
Hermitian, 36 
invertible, 22 
linear, see linear operator 
matrix representation of, 23 
self-adjoint, 36 
symmetric, 36 
Orientation, 74, 83 
Orthogonal 

complement, 210 
matrices, 47, 81 
projection operator, 217, 225 
set, 28 

Orthogonal group, 91, 96-98 

proof that it is a matrix Lie group, 112 
Orthonormal basis, 27 
for Hilbert space, 30 

P 

Pi(C 2 ), 152, 185, 191 
Pi( R 3 ), see polynomials, of degree / 
p, 21, 60, 62, 130 
Parity, 101, 160, 188 
and Dirac spinors, 183 
and 7,2, 109 

Passive transformation, 50, 81 
Pauli matrices, 1 2 


Permutation 

even, odd, 70, 110 
Permutation group, 95, 150 
relation to Z 2 , 109 
Permutations, see permutation group 
Phase space, 128 
Pin groups, 208 
Poisson bracket, 128 

formulation of mechanics, 158 
Polar decomposition theorem, 140 
Polynomials, 12 

harmonic, see harmonic polynomials 

Hermite, 37 

Laguerre, 37 

Legendre, 37 

real, 36 

Positive matrix, 140 
Positive-definite, 27 
Principal minors, 140 
Product 

of matrices, 23 
Product state, 67 
Pseudoscalar, 167, 168 
Pseudovector, 75, 149, 168, 182 

Q 

q, 130 

Quark, antiquark, 173 

R 

Rank (of map), 106 
Rank (of tensor), 3, 40 
Rank-nullity theorem, 106 
Rapidity, 100 

Real numbers (as additive group), 89 
Real vector space, 1 1 
Real-linear, 229 
Representation, 128, 145 

2nd rank antisymmetric tensor, 149, 175, 
203 

2nd rank antisymmetric tensor and adjoint 
of O(n), 166 
adjoint, 148 
adjoint of C(P), 158 
alternating, 150 
complex, 145 
conjugate, 209 
dual, 161, 162, 173 
equivalence of, 169 
faithful, 210 

four- vector, 147, 151, 201 
fundamental, 147 
irreducible, 180 

of Heisenberg algebra on L 2 (R), 157 
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Representation ( cont . ) 
of 5„, 150 
of Z 2 , 150 

on function spaces, 151 
on linear operators, 1 63 
on symmetric and antisymmetric tensors, 
165-169 
on T/(V), 161 
pseudoscalar, 168 
pseudovector, 168 
real, 145 
scalar, 166 
sgn, 150 

50(2) onR 2 , 187, 196 

50(3) on W,(R 3 ), Hi and L 2 (S 2 ), 155 

50(3) on L 2 (R 3 ), 153 

space, 145 

spin j, 152, 160 

spin-one, 195 

spin-two, 195 

spinor, 147, 151, 174, 198, 199, 205, 222 

5(7(2) on P/(C 2 ), 152 

symmetric traceless tensors, 195 

tensor product, 159 

trivial, 146 

unitary, 146 

vector, 147 

and “spin-one”, 194 
Representation operator, 2 1 6 
Right-handed spinor, see representation, spinor 
Rigid body motion, 19, 52, 78 
Rotation, 74, 96-98, 129, 138 
generators, 118, 119, 121, 122 
improper, 98 
proper, 98 

S 

5'(C 2 ), 165, 192, 199,200 
5„ , see permutation group 
S r (V), see symmetric, tensors 
5L(2,C), 102, 151 

relationship to 50(3, 1) 0 , 108, 135 
50(2), 96 

50(3), 97, 106, 129 

infinitesimal elements of, 113 
50(3, 1)„, 99, 138 
SO(n ), see special orthogonal group 
5(7(2), 98 

relationship to 50(3), 106, 135, 138 
5(7(3), 173 

5(7(«), see special unitary group 
s((2,€)r, 125 

relationship to so (3, 1), 136 
sl(n, C),227 


so(2), 121 
so(3), 122, 129 
so(3, 1), 124 
so (n), 120 
su(2), 106, 124 

isomorphic to so(3), 135 
su(h), 120 

Scalar representation, see representation, scalar 
Scalars, 9 

Schrodinger picture, 52 
Schur's lemma, 186 
Selection rule, 218 
parity, 219 
Self-adjoint, 36 

Semi-simple (group or algebra), 180 

Separable state, 67 

Sgn homomorphism, 110 

Sgn representation, 150 

Similarity transformation, 48, 164 

Space axes, 19 

Space frame, 78 

Span (of a set of vectors), 14 

Special linear group, 102 

Special orthogonal group, 94, 138 

Special unitary group, 94, 98 

Spectral theorem, 156 

Spherical harmonics, 12, 16, 23, 34, 155, 182 
Spherical Laplacian, 13, 156 
Spherical tensor, 214, 216 
Spin, 1 1, 64 

Spin angular momentum, 65 
Spin-one representation, see representation, 
spin-one 

Spin-two representation, see representation, 
spin-two 

Spinor, see representation, spinor 
Spinor representation, see representation, 
spinor 

Square-integrable, 12, 156 
Star operator, 212 
Structure constants, 132 
Subgroup, 90 
Subspace, 1 1 
Symmetric 
form, 27 
matrices, 16 
tensors, 68 

Symmetric group, see permutation group 
Symmetrization postulate. 111, 150 
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T 

ms, 23 

T s r , see tensors, of type (r, s) 

Tensor operator, 214 
Tensor product 

as addition of degrees of freedom, 63-67, 
160 

of operators, 63 
of sl(2, C)r irreps, 202, 233 
of su(2) irreps, 181 
of vector spaces, 54 
of vectors, 54 
representation, 159 
Tensors 

alternating, 70 
and linear operators, 5, 57 
and matrices, 5 
antisymmetric, 70 
as linear operators, 39 
as tensor product space, 55 
basis for vector space of, 55 
components of, 4, 40, 55 
contraction of, 56 
definition of, 39 
rank of, 40 
symmetric, 68 

transformation law, 5, 43-53 
type (r, s), 39, 55 
Time-reversal, 102 
and Z 2 , 109 
Trace, 26, 48 

cyclic property of, 121 
interpretation of, 120 
Transformation law 

of linear operators, 48, 163 
of metric tensors, 48 
of vectors and dual vectors, 46, 162 
Translations, 130 
Transpose, 35 
Transposition, 68, 109 
Trivial representation, see representation, 
trivial 


U 

17(1), 105 

U(n), see unitary group 
u(n), 118 
Unitary 
group, 92 
matrices, 47 
operator, 92 
representation, 146 

V 

V*, see dual space 
Vector operators, 63, 213, 214 
Vector representation, see representation, 
vector 

Vector space 

as additive group, 90 
axioms, 9 
complex, 1 1 
definition of, 9 
isomorphism, 105 
real, 1 1 

W 

Wedge product, 70, 1 10 
Weyl spinor, 147 

Wigner-Eckart theorem, 219-222 
X 

x, 21, 60 

Y 

Y l m , see spherical harmonics 

Z 

Z 2 , 95, 150 

and parity, time-reversal, 109 
and sgn homomorphism, 109 
irreducible representations of, 187 
is semi-simple, 187, 210 



